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NOVEL TUMOR SUPPRESSOR GENE, HIC-1 

This invention was made with government support under Grant No. R01-CA43318, 
from the National Cancer Institute. The government has certain rights in the 
invention. 

5 BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

This invention relates generally to gene expression in normal and neoplastic cells, and 
specifically to a novel tumor suppressor gene, HIC-l, and its gene product 

2 Description of Related A rt 

10 Advances in recombinant DNA technology have led to the discovery of normal 
cellular genes such as proto-oncogenes and tumor suppressor genes, which control 
growth, development, and differentiation Under certain circumstances, regulation of 
these genes is altered and they cause normal cells to assume neoplastic growth 
behavior There are over 40 known proto-oncogenes and tumor suppressor genes to 

1 5 date, which fall into various categories depending on their functional characteristics 
These include, (1) growth factors and growth factor receptors, (2) messengers of 
intracellular signal transduction pathways, for example, between the cytoplasm and 
the nucleus, and (3) regulatory proteins which influence gene expression and DNA 
replication {e.g , transcription factors). 

20 Chromosome 17p is frequently altered in human cancers, and allelic losses often 
coincide with mutations in the p53 gene at 17pl3.1 (Vogelstein, B., et al.. Cell, 
70:523, 1992). This gene is one of the most frequently altered tumor suppressor genes 
in human neoplasms. However, in some tumor types, 17p allelic loss occurs at a high 
frequency in regions distal to p53 and in the absence of p53 mutations For instance, 

25 60% of breast cancers lose 17p alleles while only 30% of these tumors contain p53 
mutations (Chen, L-C, etai. Proc. Natl. Acad. Sci. USA, 88:3847, 1991; Takita, K., 
etal.. Cancer Res , 52:3914, 1992; Deng, G . etal. Cancer Res , ^ ^99, 1994; Com- 
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elis, R.S., etal.. Cancer Res., 544200, 1994). Furthermore, in one study of breast 
cancer, the independent loss of 17pl3 3 alleles was accompanied by increased levels 
of p53 mRNA. 

Human cancer cells typically contain somatically altered genomes, characterized by 
mutation, amplification, or deletion of critical genes. In addition, the DNA template 
from human cancer cells often displays somatic changes in DNA methylation (E.R. 
Fearon, et ai. Cell, 61 759, 1990; P A. Jones, et al.. Cancer Res., 4^:461, 1986; R. 
Holliday, Science, 238 163, 1987; A. De Bustros, etai, Proc. Natl. Acad. Sci., USA, 
81 5693, 1988); PA Jones, etai, Adv. Cancer Res., 54: 1, 1990; SB. Baylin, etai, 
Cancer Cells, r.3S3, 1991; M. Makos, etal.. Proc. Natl. Acad Set.. USA, §9:1929, 
1992; N Ohtani-Fujita, etal. Oncogene, 8:1063, 1993). However, the precise role 
of abnormal DNA methylation in human tumorigenesis has not been established 
DNA methylases transfer methyl groups from the universal methyl donor S-adenosyl 
methionine to specific sites on the DNA. Several biological fimctions have been 
attributed to the methylated bases in DNA. The most established biological fiinction 
is the protection of the DNA from digestion by cognate restriction enzymes The 
restriction modification phenomenon has, so far, been observed only in bacteria 
Mammalian cells, however, possess a different methylase that exclusively methylates 
cytosine residues on the DNA, that are 5' neighbors of guanine (CpG) This 
methylation has been shown by several lines of evidence to play a role in gene 
activity, cell differentiation, tumorigenesis, X-chromosome inactivation, genomic 
imprinting and other major biological processes (Razin, A., H., and Riggs, R.D. eds 
in DNA Methylation Biochemistry and Biological Significance, Springer- Verlag, 
New York, 1984). 

A CpG rich region, or "CpG island", has recently been identified at 17pl3.3, which 
is aberrantly hypermethylated in multiple common types of human cancers (Makos, 
M., etal.. Proc. Natl Acad Sci. USA, S9:1929, 1992; Makos, M., etal.. Cancer Res . 
53:2715, 1993; Makos. M., et al. Cancer Res. 53:2719, 1993). This 
hypermethylation coincides with timing and frequency of 17p losses and p53 
mutations in brain, colon, and renal cancers. Silenced gene transcription associated 



wo 96/14877 



-3- 



PCTAJS95/14996 



with hypermethylation of the normally unmethylated promoter region CpG islands has 
been implicated as an alternative mechanism to mutations of coding regions for 
inactivation of tumor suppressor genes (Baylin, S B., ei ai. Cancer Cells, 3:383, 
1991; Jones, P A. and Buckley, J.D.,^rfv. Cancer Res , SA A-l^, 1990). This change 
5 has now been associated with the loss of expression of VHL, a renal cancer tumor 
suppressor gene on 3p (J G. Herman, et al., Proc. Natl. Acad Sci. USA, 91:9700- 
9704, 1994), the estrogen receptor gene on 6q (Ottaviano, Y.L., et al. Cancer Res., 
54:2552, 1994) and the H19 gene on 1 Ip (Steenman, M.J.C., et ai. Nature Genetics, 
2:433, 1994). 

1 0 For several human tumor types, a second tumor suppressor gene may reside distal to, 
and be interactive with, the p53 gene at chromosome 17pl3 1 There is a need to 
identify tumor suppressor genes in order to develop the appropriate methodologies for 
increasing or decreasing their expression in cells where aberrant expression is 
observed Through characterization of a 17pl3.3 CpG island which is aberrantly 

15 hypermethylated in multiple common human tumor types, the present invention 
provides such a gene HIC-1 (hypermethylated in cancer) is a novel zinc finger 
transcription factor gene which is ubiquitously expressed in normal tissues, but under- 
expressed in tumor cells {e g , breast, lung, colon, fibroblasts) where it is hyper- 
methylated. A p53 binding site is located in the 5' flanking region of HIC-1 Over- 

20 expression of a wild-type p53 gene in colon cancer cells containing only a mutant p53 
allele, results in 20-fold activation of HIC-1 expression. 

The present invention shows that many human cancers exhibit decreased HIC-1 
expression relative to their tissues of origin The limitation and failings of the prior 
art to provide meaningful markers which correlate with the presence of cell 
25 proliferative disorders, such as cancer, has created a need for markers which can be 
used diagnostically, prognostically. and therapeutically over the course of such 
disorders. The present invention fulfills such a need 
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SUMMARY OF THE INVENTION 

The present invention is based on the seminal discovery of a novel tumor suppressor 
gene. HIC-1 (hypermethylated in cancer), which is aberrantly hypermethylated in 
multiple common human tumor types. The invention provides a HIC-1 polypeptide 
as well as a polynucleotide sequence encoding the polypeptide and antibodies which 
bind to the polypeptide. 



In one embodiment, the present invemion provides a diagnostic method for detecting 
a cell proliferative disorder associated with HIC-1 in a tissue of a subject, comprising 
contacting a target ceUular component comaining HIC- 1 with a reagent which detects 
HIC-1 Such ceUular components include nucleic acid and protein. 

In another embodiment, the present invention provides a method of treating a cell 
proliferative disorder associated with HIC-1, comprising administering to a subject 
with the disorder, a therapeutically effective amount of reagem which modulates HIC- 
1 expression. For example, since HIC-1 associated disorders typically involve hyper- 
methylation of HIC- 1 polynucleotide sequence, a polynucleotide sequence which 
contains a non-methylatable nucleotide analog is utilized for treatment of a subject 



Further, the invention provides a method of gene therapy comprising introducing into 
cells of a host subject, an expression vector comprising a nucleotide sequence 
encoding HIC-1, in operable linkage with a promoter. 



20 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE lA is a diagram showing a map of an 1 1.0 kb region of cosmid C-13A 
which contains a 50 kb human DNA insert harboring the region of chromosome 
17pl3.3 previously shown to have hypermethylation in multiple human tumor types 
5 (Makos, M., et al, Proc. Natl. Acad. Sci. USA, 89: 1929, 1992; Makos, M., et al.. 
Cancer Res., 53:2715. 1993; Makos, M., etal.. Cancer Res 53:2719, 1993) The 
position of the YNZ22 probe, EcoRI (E) restriction site and the location of a series of 
cosmid subclones which were prepared to span the area are shown 

FIGURE IB is a schematic for the HIC-1 gene which was found to be encompassed 
1 0 within the region shown in FIGURE 1 A and for which the amino acid sequence is 
shown in FIGURE 2B. Shown are: potential p53 binding site; TATAA = the TATA 
box sequence 40 bp upstream from the transcription start site; 5' UTR = the 1st 
untranslated exon, ATG = the most 5' translation start site; ZIN (zinc finger N- 
terminus) = the 478bp exon encompassing the highly conserved region (FIGURE 2A) 
15 of the Zin domain subfemily of zinc finger transcription factors; rectangle with shaded 
bars represents the 2015 bp last exon of HIC-1 and each shaded bar represems one of 
the 5 zinc fingers (FIGURE 2B) clustered in this 3' region of the gene; TAG = 
translation stop site in the HIC-1 gene, AATAAA = polyadenylation signal site found 
835 bp from the translation stop site. 

20 FIGURE IC and SEQ ID NO: 1 and 2 show the nucleotide and deduced amino acid 
sequence of HIC- 1 . 

FIGURE 2A and SEQ ID NO:3 show the amino acid sequences of HIC-1 The HIC-1 
amino acid sequence is compared with the conserved N-terminus region of the other 
members of the Zin domain zinc finger family. In the parentheses, the numbers 
25 indicate the position of the conserved region relative to the translation start site of 
each gene. The darkest shading shows position of amino acids which are idemical for 
at least five of the 9 proteins and the lighter shading shows position of conservative 
amino acid differences between the family members. D = drosophila, M = murine; 
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H - human. The bracket of amino acids at the bottom represents an area in HIC- 1 not 
found at this position in the other family members. 

HGURE 2B shows the entire coding region of the HIC- 1 gene. The deduced amino 
acid sequence for the two coding exons of HIC- 1 , as defined by the sequence analyses 
and expression strategies outlined in the text, are shown. The 5 zinc fingers in the 3' 
half of the protein are shown by the shaded boxes. 

HGURE 3 shows a Northern analyses of HIC-1 gene expression S = spleen; The = 
thymus; P= prostate. Te = testis; O = ovary; SI = small intestine; B = peripheral 
blood cells. The band above the 4 4 kb marker co-hybridizes with ribosomal RNA. 
The ~1 . 1 kb band has not yet been identified but could be an alternate splice product 
since it was not detected with probes from the zinc finger or 3' untranslated regions 
of HIC-1 



FIGURE 4 A shows RNAse protection assays of HIC-1 gene expression in a variety 
of normal and neoplastic human tissues In all panels, the top asterisk marks the 

15 position of the undigested 360bp HIC-1 gene RNA probe which was derived fi-om the 
region containing the zinc fingers in cosmid subclone 600 (FIGURE lA) The 
protected HIC-1 fragment (300bp) is labeled HIC- 1 FIGURE 4 A compares 
expression in 10 ug of total RNA from 2 established culture lines of normal human 
fibroblasts (WI-38 and IMR-90) to the HT 1080 culture line of fibrosarcoma cells 

20 (Fibro-C), from 3 different samples of normal colon (Colon - N) to the colon 
carcinoma cell line, CaCO, (Colon-C), and from a sample of normal lung (Lung-N) 
to the established line of human small cell lung carcinoma, NCI-H209 (Lung-C) 
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FIGURE 4B shows the RNAse protection assay for 10 ug of RNA from 6 different 
established culture lines of breast carcinoma (lane I MDA231, lane 2 HS58T; lane 3 
MDA468; lane 4 T47D; lane 5 MCF7; lane 6 MDA453), each of which has extensive 
methylation of Not I sites of the HIC-1 CpG island. 

5 nOURE 4C shows the RNAse protection assay for 1 0 ug of RNA from normal fetal 
brain (B) compared to a series of non-cultured brain tumors (1 anaplastic astrocytoma 
(AA) and 8 more advanced glioblastomas (lanes 1-8). 

FIGURE 5 shows an RNAse protection assay, as detailed in FIGURE 4, after 
infection of an adenoviral vector containing either the p-galactosidase gene or the 
10 wild type human p53 gene into the SW480 line of human colon cancer cells. 
(Uninfected, normal, control human fibroblasts (F), uninfected SW480 cells (U), 
SW480 ceUs infected with the p-galactosidase gene (GAL), and SW480 cells infeaed 
with the p53 gene (p53)). Positions of the undigested HIC-1 and GAPDH probes and 
of the mC-l and GAPDH transcripts are marked exactly as in FIGURE 4. 

1 5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a novel tumor suppressor gene, HIC-l 
(hypermethylated in cancer). HIC-l is located on chromosome 17pl3 3, distal to the 
tumor suppressor gene. p53, at 17pl3.1, within a CpG island which is abnormally 
methylated in many different types of tumors. This abnormally methylated CpG 
20 island completely encompasses the coding region of HIC- 1 gene. 

In a first embodiment, the present invention provides a substantially pure HIC-1 
polypeptide consisting essentially of the amino acid sequence shown in FIGURE 2B 
and SEQ ID NO:3. HIC-1 polypeptide is characterized as having a distinct amino 
acid homology to a highly conserved N-terminal motif, termed the Zin (Zinc finger 
25 N-terminal) domain, which is present in each member of subset of zinc finger 
transcription factors. In addition, it also has five Kruppel type Cys^-Hisj zinc fingers 
characteristic of the 3' region of those same proteins 
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The term "substantially pure" as used herein refers to HIC-1 polypeptide which is 
substantially free of other proteins, lipids, carbohydrates or other materials with which 
it is naturally associated. One skilled in the art can purify HIC-1 using standard 
techniques for protein purification The substantially pure polypeptide will yield a 
single major band on a non-reducing polyacrylamide gel The purity of the HIC-1 
polypeptide can also be determined by amino-terminal amino acid sequence analysis 

The invention includes a functional polypeptide, HIC-1, and functional fragments 
thereof As used herein, the term "functional polypeptide" refers to a polypeptide 
which possesses a biological function or activity which is identified through a defined 
functional assay and which is associated with a particular biologic, morphologic, or 
phenotypic alteration in the cell Functional fragments of the HIC-1 polypeptide, 
include fragments of HIC-1 which retain the activity of e.g., tumor suppressor 
activity, of HIC-1. Smaller peptides containing the biological activity of HIC-1 are 
included in the invention The biological function, for example, can vary from a 
polypeptide fragment as small as an epitope to which an antibody molecule can bind 
to a large polypeptide which is capable of participating in the characteristic induction 
or programming of phenotypic changes within a cell. A "functional polynucleotide" 
denotes a polynucleotide which encodes a functional polypeptide as described herein 

Minor modifications of the HIC-1 primary amino acid sequence may result in proteins 
which have substantially equivalent activity as compared to the HIC-1 polypeptide 
described herein. Such modifications may be deliberate, as by site-directed 
mutagenesis, or may be spontaneous. All of the polypeptides produced by these 
modifications are included herein as long as the tumor suppressor activity of HIC-1 
is present. Further, deletion of one or more amino acids can also result in a 
modification of the structure of the resultant molecule without significantly altering 
its activity This can lead to the development of a smaller active molecule which 
would have broader utility. For example, it is possible to remove amino or carboxy 
terminal amino acids which may not be required for HIC-1 activity 



W0 96/14S77 PCr/US95/14996 

-9- 

The HIC-1 polypeptide of the invention also includes conservative variations of the 
polypeptide sequence. The term "conservative variation" as used herein denotes the 
replacement of an amino acid residue by another, biologically similar residue. 
Examples of conservative variations include the substitution of one hydrophobic 
5 residue such as isoleucine, valine, leucine or methionine for another, or the substitu- 
tion of one polar residue for another, such as the substitution of arginine for lysine, 
glutamic for aspartic acids, or glutamine for asparagine, and the like The term 
"conservative variation" also includes the use of a substituted amino acid in place of 
an unsubstituted parent amino acid provided that antibodies raised to the substituted 
10 polypeptide also immunoreact with the unsubstituted polypeptide. 

The invention also provides an isolated polynucleotide sequence consisting essentially 
of a polynucleotide sequence encoding a polypeptide having the amino acid sequence 
of SEQ ID NO; 3. The polynucleotide sequence of the invention also includes the 5* 
and 3' untranslated sequences and includes regulatory sequences, for example. The 

15 term "isolated" as used herein includes polynucleotides substantially free of other 
nucleic acids, proteins, lipids, carbohydrates or other materials with which it is 
naturally associated Polynucleotide sequences of the invention include DNA, cDNA 
and RNA sequences which encode HIC-1. It is understood that all polynucleotides 
encoding all or a portion of HIC-1 are also included herein, as long as they encode a 

20 polypeptide with HIC-1 activity. Such polynucleotides include naturally occurring, 
synthetic, and intentionally manipulated polynucleotides For example, HIC-1 
polynucleotide may be subjected to site-directed mutagenesis. The polynucleotide 
sequence for HIC-1 also includes antisense sequences The polynucleotides of the 
invention include sequences that are degenerate as a result of the genetic code There 

25 are 20 natural amino acids, most of which are specified by more than one codon. 

Therefore, all degenerate nucleotide sequences are included in the invention as long 
as the amino add sequence of HIC-1 polypeptide encoded by the nucleotide sequence 
is functionally unchanged. In addition, the invention also includes a polynucleotide 
consisting essentially of a polynucleotide sequence encoding a polypeptide having an 

30 amino acid sequence of SEQ ED NO 3 and having at least one epitope for an antibody 
immunoreactive with HIC-1 polypeptide 
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The polynucleotide encoding HIC-1 includes the nucleotide sequence in FIGURE IC 
(SEQ ID NO l and 2), as well as nucleic acid sequences complementary to that 
sequence. A complementary sequence may include an antisense nucleotide. When 
the sequence is RNA, the deoxynucleotides A, G, C, and T of FIGURE IC (SEQ ID 
5 NO: 1 and 2) are replaced by ribonucleotides A, G, C, and U, respectively Also 
included in the invention are fragments of the above-described nucleic acid sequences 
that are at least 15 bases in length, which is sufficient to permit the fragment to 
selectively hybridize to DNA that encodes the protein of FIGURE 2B (SEQ ID NO: 
3) under physiological conditions and under moderately stringent conditions. 

10 Specifically disclosed herein is a DNA sequence for HIC-1 which schematically is 
illustrated in FIGURES 1 A and IB (see also, FIGURE IC and SEQ ID NO: 2). The 
transcribed exon encompasses 5 zinc fingers and extends 359 bp from the last zinc 
fmger to the stop site The transcription proceeds 239 bp past the stop site, in an 
apparent 3' untranslated region (UTR). There is also a polyadenylation signal, 

1 5 AATAAA, at position 835 bp from the stop site In addition, after the Zin domain and 
before the zinc finger exons, there is a consensus splice donor and an acceptor site 
separated by an intron region. The complete coding region of HIC-1 is encompassed 
by two exons within the CpG rich 3 .0 kb region between Not I sites N3 and N7 

DNA sequences of the invention can be obtained by several methods. For example, 
20 the DNA can be isolated using hybridization techniques which are well known in the 
art. These include, but are not limited to: 1) hybridization of genomic or cDNA 
libraries with probes to detect homologous nucleotide sequences and 2) antibody 
screening of expression libraries to detect cloned DNA fragments with shared 
structural features. 

25 Preferably the HIC- 1 polynucleotide of the invention is derived from a mammalian 
organism, and most preferably from human. Screening procedures which rely on 
nucleic acid hybridization make it possible to isolate any gene sequence from any 
organism, provided the appropriate probe is available. Oligonucleotide probes, which 
correspond to a part of the sequence encoding the protein in question, can be 
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synthesized chemically. This requires that short, oligopeptide stretches of amino acid 
sequence must be known. The DNA sequence encoding the protein can be deduced 
from the genetic code, however, the degeneracy of the code must be taken into 
account. It is possible to perform a mixed addition reaction when the sequence is 
5 degenerate. This includes a heterogeneous mixture of denatured double- stranded 
DNA. For such screening, hybridization is preferably performed on either single- 
stranded DNA or denatured double-stranded DNA. Hybridization is particularly 
useful in the detection of cDNA clones derived from sources where an extremely low 
amount of mRNA sequences relating to the polypeptide of interest are present In 
1 0 other words, by using stringent hybridization conditions directed to avoid non-specific 
binding, it is possible, for example, to allow the autoradiographic visualization of a 
specific cDNA clone by the hybridization of the target DNA to that single probe in 
the mixture which is its complete complement (Wallace, et aL, NucL Acid Res,, 9:879, 
1981). 

1 5 The development of specific DNA sequences encoding HIC- 1 can also be obtained 
by: 1) isolation of double- stranded DNA sequences from the genomic DNA; 2) 
chemical manufacture of a DNA sequence to provide the necessary codons for the 
polypeptide of interest; and 3) in vitro synthesis of a double-stranded DNA sequence 
by reverse transcription of mRNA isolated from a eukaryotic donor cell. In the latter 

20 case, a double-stranded DNA complement of mRNA is eventually formed which is 
generally referred to as cDNA. 

Of the three above-noted methods for developing specific DNA sequences for use in 
recombinant procedures, the isolation of genomic DNA isolates is the least common. 
This is especially true when it is desirable to obtain the microbial expression of 
25 mammalian polypeptides due to the presence of introns. 

The synthesis of DNA sequences is frequently the method of choice when the entire 
sequence of amino acid residues of the desired polypeptide product is known When 
the entire sequence of amino acid residues of the desired polypeptide is not known, 
the direct synthesis of DNA sequences is not possible and the method of choice is the 
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synthesis of cDNA sequences Among the standard procedures for isolating cDNA 
sequences of interest is the formation of plasmid- or phage-carrying cDNA libraries 
which are derived from reverse transcription of mRNA which is abundant in donor 
cells that have a high level of gene expression. When used in combination with 
5 polymerase chain reaction technology, even rare expression products can be cloned 
In those cases where significant portions of the amino acid sequence of the 
polypeptide are known, the production of labeled single or double-stranded DNA or 
RNA probe sequences duplicating a sequence putatively present in the target cDNA 
may be employed in DNA/DNA hybridization procedures which are carried out on 
1 0 cloned copies of the cDNA which have been denatured into a single-stranded form 
(Jay, e/a/., Nuci Acid Res,, 11:2325, 1983). 

A cDNA expression library, such as lambda gtl 1, can be screened indirectly for HIC- 
1 peptides having at least one epitope, using antibodies specific for HIC-1. Such 
antibodies can be either polyclonally or monoclonally derived and used to detect 
1 5 expression product indicative of the presence of HIC-1 cDNA 

DNA sequences encoding HIC-1 can be expressed in vitro by DNA transfer into a 
suitable host cell. "Host cells" are cells in which a vector can be propagated and its 
DNA expressed. The term also includes any progeny of the subject host cell It is 
understood that all progeny may not be identical to the parental cell since there may 
20 be mutations that occur during replication However, such progeny are included when 
the term "host cell" is used. Methods of stable transfer, meaning that the foreign DNA 
is continuously maintained in the host, are known in the art 

In the present invention, the HIC-1 polynucleotide sequences may be inserted into a 
recombinant expression vector. The term "recombinant expression vector" refers to 
25 a plasmid, virus or other vehicle known in the art that has been manipulated by 
insertion or incorporation of the HIC-1 genetic sequences. Such expression vectors 
contain a promoter sequence which facilitates the efficient transcription of the inserted 
genetic sequence of the host. The expression vector typically contains an origin of 
replication, a promoter, as well as specific genes which allow phenotypic selection of 
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the transformed cells. Vectors suitable for use in the present invention include, but 
are not limited to the T7-based expression vector for expression in bacteria 
(Rosenberg, et al. Gene ,56:125, 1987), the pMSXND expression vector for 
expression in mammalian ceUs (Lee and Nathans, J. Biol. Chem., 263 3521, 1988) and 
5 baculovirus-derived vectors for expression in insect cells. The DNA segment can be 
present in the vector operably linked to regulatory elements, for example, a promoter 
(e.g., T7, metallothionein I, or polyhedrin promoters). 

Polynucleotide sequences encoding HIC-1 can be expressed in either prokaryotes or 
eukaryotes. Hosts can include microbial, yeast, insect and mammalian organisms 
10 Methods of expressing DNA sequences having eukaryotic or viral sequences in 
prokaryotes are weU known in the art. Biologically functional viral and plasmid DNA 
vectors capable of expression and replication in a host are known in the art. Such 
vectors are used to incorporate DNA sequences of the invention. 

Methods which are well known to those skilled in the art can be used to construct 
15 expression vectors containing the HIC-1 coding sequence and appropriate 
transcriptional/translational control signals These methods include in vitro 
recombinant DNA techniques, synthetic techniques, and in vivo recombination/genetic 
techniques See, for example, the techniques described in Maniatis, et al., 1989 
Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y 

20 A variety of host-expression vector systems may be utilized to express the HIC-1 
coding sequence These include but are not limited to microorganisms such as 
bacteria transformed with recombinant bacteriophage DNA plasmid DNA or cosmid 
DNA expression vectors containing the HIC-1 coding sequence; yeast transformed 
with recombinant yeast expression vectors containing the HIC-1 coding sequence, 

25 plant cell systems infected with recombinant virus expression vectors {e.g., 
cauliflower mosaic vims, CaMV; tobacco mosaic virus. TMV) or transformed with 
recombinant plasmid expression vectors (e.g., Ti plasmid) containing the HIC-1 
coding sequence; insect cell systems infected with recombinant virus expression 
vectors (e.g.. baculovirus) containing the HIC-1 coding sequence; or animal cell 
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systems infected with recombinant virus expression vectors {e.g., retroviruses, 
adenovirus, vaccinia virus) containing the HIC-1 coding sequence, or transformed 
animal cell systems engineered for stable expression. Since HIC-1 has not been 
confirmed to contain carbohydrates, both bacterial expression systems as well as those 
that provide for translational and post-translational modifications may be used, e.g., 
mammalian, insect, yeast or plant expression systems. 

Depending on the host/vector system utilized, any of a number of suitable 
transcription and translation elements, including constitutive and inducible promoters, 
transcription enhancer elements, transcription terminators, etc may be used in the 
expression vector (see e.^.. Bitter, et ai. Methods in Enzymoiogy 153 :5 16-544, 1 987) 
For example, when cloning in bacterial systems, inducible promoters such as pL of 
bacteriophage y, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used. 
When cloning in mammalian cell systems, promoters derived from the genome of 
mammalian cells {e.g., metallothionein promoter) or from mammalian viruses {e.g., 
the retrovirus long terminal repeat; the adenovirus late promoter; the vaccinia virus 
7 5K promoter) may be used Promoters produced by recombinant DNA or synthetic 
techniques may also be used to provide for transcription of the inserted HIC-1 coding 
sequence In addition, the endogenous HIC-1 promoter may also be used to provide 
transcription machinery of HIC- 1 . 

In bacterial systems a number of expression vectors may be advantageously selected 
depending upon the use intended for the expressed For example, when large 
quantities of HIC-l are to be produced, vectors which direct the expression of high 
levels of ftision protein products that are readily purified may be desirable. Those 
which are engineered to contain a cleavage site to aid in recovering are preferred. 
Such vectors include but are not limited to the E. coli expression vector pUR278 
(Ruther, etal, EMBOJ. 2: 1791, 1983), in which the HIC-1 coding sequence may be 
ligated into the vector in frame with the lac Z coding region so that a hybrid -lac Z 
protein is produced; pIN vectors (Inouye & Inouye, Nucleic Acids Res., 13:3101- 
3109, 1985; Van Heeke & Schuster, J. Biol. Chem. 264:5503-5509, 1989); 
glutathione-S-transferase (GST) and the like 
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In yeast, a number of vectors containing constitutive or inducible promoters may be 
used For a review see. Current Protocols in Molecular Biology, Vol. 2, 1988, Ed 
Ausubel, etal., Greene Publish. Assoc. & Wiley Interscience, Ch. 13; Grant, et ai, 
1987, Expression and Secretion Vectors for Yeast, in Methods in Enzymology, Eds. 
5 Wu «fe Grossman, 31987, Acad. Press, N.Y., Vol 153, pp.516-544; Glover. 1986, 
DNA Cloning, Vol. n, IRL Press. Wash.. D.C., Ch. 3, and Bitter, 1987, Heterologous 
Gene Expression in Yeast, Methods in Enzymology, Eds Berger & Kimmel, Acad. 
Press, N.Y., Vol. 152, pp. 673-684; and The Molecular Biology of the Yeast 
Saccharomyces, 1982, Eds Strathem, etal.. Cold Spring Harbor Press, Vols. I and II. 
10 A constitutive yeast promoter such as ADH or LEU2 or an inducible promoter such 
as GAL may be used {Cloning in Yeast, Ch. 3, R. Rothstein In: DNA Cloning Vol. 1 1. 
A Practical Approach, Ed. DM Glover, 1986, IRL Press, Wash , D C.) Alternatively, 
vectors may be used which promote integration of foreign DNA sequences into the 
yeast chromosome. 

15 In cases where plant expression vectors are used, the expression of the HIC- 1 coding 
sequence may be driven by any of a number of promoters. For example, viral 
promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson, et ai. 
Nature 210:5 1 1-5 14, 1984), or the coat protein promoter to TMV (Takamatsu, et al. , 
EMBO J 6:307-31 1, 1987) may be used; alternatively, plant promoters such as the 

20 small subunit of RUBISCO (Coruzzi, et al. , EMBO J 3 : 1 67 1 - 1 680, 1 984; Broglie, et 
al. Science 224:838-843, 1984); or heat shock promoters, e.g., soybean hspI7.5-E or 
hspl7.3-B (Guriey, et ai, Mol. Cell. Biol. 6:559-565, 1986) may be used These 
constructs can be introduced into plant cells using Ti plasmids, Ri plasmids, plant 
virus vectors, direct DNA transformation, microinjection, elearoporation, etc. For 

25 reviews of such techniques see, for example, Weissbach & Weissbach, 1988, Methods 
for Plant Molecular Biology, Academic Press, NY, Section VIII, pp. 421-463; and 
Grierson & Corey, 1988, Plant Molecular Biology, 2d Ed., Blackie, London, Ch 7-9 

An alternative expression system which could be used to express is an insect system 
In one such system, Autographa califomica nuclear polyhedrosis virus (AcNPV) is 
30 used as a vector to express foreign genes The virus grows in Spodoptera frugiperda 
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cells The HIC-1 coding sequence may be cloned into non-essential regions (for 
example the polyhedrin gene) of the virus and placed under control of an AcNPV 
promoter (for example the polyhedrin promoter). Successful insertion of the HIC-1 
coding sequence will result in inactivation of the polyhedrin gene and production of 
5 non-occluded recombinant virus (i.e., virus lacking the proteinaceous coat coded for 
by the polyhedrin gene). These recombinant viruses are then used to infect 
Spodoptera frugiperda cells in which the inserted gene is expressed (e.g., see Smith, 
e/a/ , 1983, y. VioL 46:584, U S. Smith, Patent No 4,215,051) 

Eukaryotic systems, and preferably manmialian expression systems, allow for proper 
10 post-translationai modifications of expressed manmialian proteins to occur 
Eukaryotic cells which possess the cellular machinery for proper processing of the 
primary transcript, glycosylation, phosphorylation, and advantageously, secretion of 
the gene product may be used as host cells for the expression of HIC-1 Mammalian 
cell lines may be preferable. Such host cell lines may include but are not limited to 
15 CHO, VERO, BHK, HeLa, COS, MDCK, -293, and WI38 

N4ammalian cell systems which utilize recombinant viruses or viral elements to direct 
expression may be engineered. For example, when using adenovirus expression 
vectors, the HIC-1 coding sequence may be ligated to an adenovirus transcription/- 
translation control complex, e g , the late promoter and tripartite leader sequence 

20 This chimeric gene may then be inserted in the adenovirus genome by in vitro or in 
vivo recombination. Insertion in a non-essential region of the viral genome (e.g., 
region El or E3) will result in a recombinant virus that is viable and capable of 
expressing the protein in infected hosts {e,g., see Logan & Shenk, Proc, NatL Acad, 
Sci, USA, 81:3655-3659, 1984), Alternatively, the vaccinia virus 7.5K promoter may 

25 be used (e.g., see, Mackett, eiai, 1982, Proc. NatL Acad. Sci. USA 79:7415-7419, 
Mackett, etai^J, Virol, 49:857-864, 1984;Panicali, etaL, Proc. NatL Acad. ScL USA 
79:4927-4931, 1982). Of particular interest are vectors based on bovine papilloma 
vims which have the ability to replicate as extrachromosomal elements (Sarver, et al., 
MoL CelL BioL \ \ 486, 1981). Shortly after entry of this DNA into mouse cells, the 

30 plasmid replicates to about 100 to 200 copies per cell Transcription of the inserted 



wo 96/14877 



-17- 



PCTAJS95/14996 



cDNA does not require integration of the plasmid into the host's chromosome, thereby 
yielding a high level of expression. These vectors can be used for stable expression 
by including a selectable marker in the plasmid, such as, for example, the neo gene. 
Alternatively, the retroviral genome can be modified for use as a vector capable of 
5 introducing and directing the expression of the HIC-1 gene in host cells (Cone & 
Mulligan, Proc. Natl. Acad. Sci. USA 81:6349-6353, 1984) High level expression 
may also be achieved using inducible promoters, including, but not Umited to, the 
metallothionine HA promoter and heat shock promoters. 

For long-term, high-yield production of recombinant proteins, suble expression is 
10 preferred Rather than using expression vectors which contain viral origins of 
replication, host cells can be transformed with the HIC-1 cDNA controlled by 
appropriate expression control elements {e.g.. promoter, enhancer, sequences, 
transcription terminators, polyadenylation sites, etc.), and a selectable marker. The 
selectable marker in the recombinant plasmid confers resistance to the selection and 
1 5 allows cells to stably integrate the plasmid into their chromosomes and grow to form 
foci which in turn can be cloned and expanded into cell lines. For example, following 
the introduction of foreign DNA, engineered cells may be allowed to grow for 1-2 
days in an enriched media, and then are switched to a selective media. A number of 
selection systems may be used, including but not limited to the herpes simplex virus 
20 thymidine kinase (Wigler, et ai. Cell, 11:223, 1977), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska & Szybalski, Proc. Natl. Acad. Sci. USA, 
48 2026, 1962), and adenine phosphoribosyltransferase (Lowy, et al.. Cell. 22: 817, 
1980) genes can be employed in tk', hgprt or aprt cells respectively. Also, 
antimetabolite resistance can be used as the basis of selection for dhfi-, which confers 
25 resistance to methotrexate (Wigler, et al.. Natl. Acad Sci. USA, Z7.3567, 1980, 
O'Hare. et al., Proc. Natl. Acad Sci. USA . 78: 1527, 1981); gpt, which confers 
resistance to mycophenolic acid (Mulligan & Berg, Proc. Natl. Acad Sci. USA. 78: 
2072, 1981; neo, which confers resistance to the aminoglycoside G-418 (Colberre- 
Garapin, et al.. J. Mol. Biol., HQ l, 1981); and hygro. which confers resistance to 
30 hygromycin (Santerre, et al.. Gene. 30:147. 1984) genes Recently, additional 
selectable genes have been described, namely trpB. which allows cells to utilize 
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indole in place of tryptophan; hisD, which allows cells to utilize histinol in place of 
histidine (Hartman & Mulligan, Proc. Natl. Acad ScL USA, 85 8047, 1988); and ODC 
(ornithine decarboxylase) which confers resistance to the ornithine decarboxylase 
inhibitor, 2-(difluoromethyl)-DL-omithine, DFMO (McConlogue L., 1987, In: 
Current Comnmmcations in Molecular Biology, Cold Spring Harbor Laboratory, ed.) 

Transformation of a host cell with recombinant DNA may be carried out by 
conventional techniques as are well known to those skilled in the art. Where the host 
is prokaryotic, such as E, coli, competent cells which are capable of DNA uptake can 
be prepared from cells harvested after exponential growth phase and subsequently 
treated by the CaCl2 method using procedures well known in the art. Alternatively, 
MgClj or RbCl can be used. Transformation can also be performed after forming a 
protoplast of the host cell if desired. 

When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate co-precipitates, conventional mechanical procedures such as 
microinjection, electroporation, insertion of a plasmid encased in liposomes, or virus 
vectors may be used Eukaryotic cells can also be cotransformed with DNA sequenc- 
es encoding the HIC-1 of the invention, and a second foreign DNA molecule encoding 
a selectable phenotype, such as the herpes simplex thymidine kinase gene Another 
method is to use a eukaryotic viral vector, such as simian virus 40 (SV40) or bovine 
papilloma virus, to transiently infect or transform eukaryotic cells and express the 
protein (see for example, Eukaryotic Viral Vectors, Cold Spring Harbor Laboratory, 
Gluzman, ed., 1982). 

Isolation and purification of microbial or host cell expressed polypeptide, or 
fragments thereof, provided by the invention, may be carried out by conventional 
means including preparative chromatography and affinity and immunological 
separations involving monoclonal or polyclonal antibodies 
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The invention includes antibodies immunoreactive with HIC-1 polypeptide (SEQ ED 
NO: 3) or immunoreactive fragments thereof. Antibody which consists essentially of 
pooled monoclonal antibodies with different epitopic specificities, as well as distinct 
monoclonal antibody preparations are provided. Monoclonal antibodies are made 
5 from antigen containing fragments of the protein by methods well known to those 
skilled in the art (Kohler, et ai, Nature, 256:495, 1975). The term antibody as used 
in this invention is meant to include intact molecules as well as fragments thereof, 
such as Fab and F(ab')2, which are capable of binding an epitopic determinant on HIC- 
1. 

10 The invention also provides a method for detecting a cell proliferative disorder 
associated with HIC-1 in a subject, comprising contacting a target cellular component 
suspected of having a HIC-1 associated disorder, with a reagent which reacts with or 
binds to HIC-1 and detecting HIC-1 . The target cell component can be nucleic acid, 
such as DNA or RNA, or it can be protein. When the component is nucleic acid, the 

1 5 reagent is typically a nucleic acid probe or PCR primer. When the cell component is 
protein, the reagent is typically an antibody probe. The target cell component may be 
detected directly in situ or it may be isolated from other cell components by conunon 
methods known to those of skill in the art before contacting with a probe (See for 
example, Maniatis, et ai, Molecular Cloning, A Laboratory Manual, Cold Spring 

20 Harbor Laboratory, N. Y, 1989, Current Protocols in Molecular Biology, 1994, Ed 
Ausubel, et ai. Greene Publ. Assoc. & Wiley Interscience.) Detection methods 
include Southern and Northern blot analyses, RNase protection, immunoassays and 
other detection assays that are known to those of skill in the art. 



The probes can be detectably labeled, for example, with a radioisotope, a fluorescent 
25 compound, a bioluminescent compound, a chemiluminescent compound, a metal 
chelator, or an enzyme Those of ordinary skill in the art wUl know of other suitable 
labels for binding to the probes or will be able to ascertain such, using routine 
experimentation. 
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Since the present invention shows that a decreased level of HIC-1 transcription is 
often the result of hypermethylation of the HIC-1 gene, it is often desirable to directly 
determine whether the HIC-1 gene is hypermethylated. In particular, the cytosine rich 
areas terms "CpG islands" which lie in the 5* regulatory regions of genes are normally 
5 unmethylated The term "hypermethylation" includes any methylation of cytosine 
which is normally unmethylated in the HIC-1 gene sequence can be detected by 
restriction endonuclease treatment of HIC-1 polynucleotide (gene) and Southern blot 
analysis for example Therefore, in a method of the invention, when the cellular 
component detected is DNA, restriction endonuclease analysis is preferable to detect 

1 0 hypermethylation of the HIC-1 gene. Any restriction endonuclease that includes CG 
as part of its recognition site and that is inhibited when the C is methylated, can be 
utilized Methylation sensitive restriction endonucleases such as BssHll, Mspl, Noil 
or Hpall, used alone or in combination are examples of such endonucleases. Other 
methylation sensitive restriction endonucleases will be knovm to those of skill in the 

15 art . In addition, PCR can be utilized to detect the methylation status of the HIC-1 
gene Oligonucleotide primers based on any coding sequence region in the HIC-1 
sequence are useful for amplyifying DNA by PCR. 

For purposes of the invention, an antibody or nucleic acid probe specific for HIC- 1 
may be used to detect the presence of HIC-1 polypeptide (using antibody) or 

20 polynucleotide (using nucleic acid probe) in biological fluids or tissues 
Oligonucleotide primers based on any coding sequence region in the HIC-1 sequence 
are useful for amplifying DNA, for example by PCR Any specimen containing a 
detectable amount of HIC-1 polynucleotide or HIC-1 polypeptide antigen can be used 
Nucleic acid can also be analyzed by RNA in situ methods which are known to those 

25 of skill in the art, A preferred sample of this invention is tissue of heart, renal, brain, 
colon, breast, urogenital, uterine, hematopoietic, prostate, thymus, lung, testis, and 
ovarian Preferably the subject is human 

Various disorders which are detectable by the method of the invention include 
astrocytoma, anaplastic astrocytoma, glioblastoma, meduUoblastoma, colon cancer. 
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lung cancer, renal cancer, leukemia, breast cancer, prostate cancer, endometrial cancer 
and neuroblastoma. 

Monoclonal antibodies used in the method of the invention are suited for use, for 
example, in immunoassays in which they can be utilized in liquid phase or bound to 

5 a solid phase carrier. In addition, the monoclonal antibodies in these immunoassays 
can be detectably labeled in various ways. Examples of types of immunoassays which 
can utiUze monoclonal antibodies of the invention are competitive and non- 
competitive immunoassays in either a direct or indirect format. Examples of such 
immunoassays are the radioimmunoassay (RIA) and the sandwich (immunometric) 

10 assay Detection of the antigens using the monoclonal antibodies of the invention can 
be done utilizing immunoassays which are run in either the forward, reverse, or 
simultaneous modes, including immunohistochemical assays on physiological 
samples. Those of skill in the art will know, or can readily discern, other 
immunoassay formats without undue experimentation. 

1 5 The term "immunometric assay" or "sandwich immunoassay", includes simultaneous 
sandwich, forward sandwich and reverse sandwich immunoassays. These terms are 
well understood by those skilled in the art. Those of skill will also appreciate that 
antibodies according to the present invention will be useful in other variations and 
forms of assays which are presently known or which may be developed in the future. 

20 These are intended to be included within the scope of the present invention. 

Monoclonal antibodies can be bound to many difFerem carriers and used to detect the 
presence of HIC-1. Examples of well-known carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified 
ceUuloses, polyacrylamides. agaroses and magnetite. The nature of the carrier can be 
25 either soluble or insoluble for purposes of the invention Those skilled in the art will 
know of other suitable carriers for binding monoclonal antibodies, or will be able to 
ascertain such using routine experimentation. 
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In performing the assays it may be desirable to include certain "blockers" in the 
incubation medium (usually added with the labeled soluble antibody). The "blockers" 
are added to assure that non-specific proteins, proteases, or anti-heterophilic immuno- 
globulins to anti-HIC-1 immunoglobulins present in the experimemal sample do not 
cross-link or destroy the antibodies on the solid phase support, or the radiolabeled 
indicator antibody, to yield false positive or false negative results. The selection of 
"blockers" therefore may add substantially to the specificity of the assays described 
in the present invention. 

It has been found that a number of nonrelevant (i.e., nonspecific) antibodies of the 
same class or subclass (isotype) as those used in the assays (e.g.. IgGl, IgG2a, IgM, 
etc.) can be used as "blockers" The concentration of the "blockers" (normally 1-100 
Mg/Ml) may be important, in order to maintain the proper sensitivity yet inhibit any 
unwanted imerference by mutually occurring cross reactive proteins in the specimen. 

In using a monoclonal antibody for the in vivo detection of antigen, the detectably 
1 5 labeled monoclonal antibody is given in a dose which is diagnostically eflFective The 
term "diagnostically effective" means that the amoum of detectably labeled 
monoclonal antibody is administered in sufficient quantity to enable detection of the 
site having the HIC-1 antigen for which the monoclonal antibodies are specific The 
concentration of detectably labeled monoclonal antibody which is administered 
should be sufficient such that the binding to those cells having HIC-1 is detectable 
compared to the background. Further, it is desirable that the detectably labeled 
monoclonal antibody be rapidly cleared fi-om the circulatory system in order to give 
the best target-to-background signal ratio. 



20 



25 



As a nile. the dosage of detectably labeled monoclonal antibody for in vivo diagnosis 
will vary depending on such factors as age, sex, and extent of disease of the individu- 
al. The dosage of monoclonal antibody can vary fi-om about 0.001 mg/m^ to about 
500 mg/m^ preferably 0.1 mg/m^ to about 200 mg/n? , most preferably about 0.1 
mg/m^ to about 10 mg/A Such dosages may vary, for example, depending on 
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whether multiple injections are given, tumor burden, and other factors known to those 
of skill in the art. 

For in vivo diagnostic imaging, the type of detection instrument available is a major 
factor in selecting a given radioisotope. The radioisotope chosen must have a type of 

5 decay which is detectable for a given type of instrumem. Still another important 
factor in selecting a radioisotope for in vivo diagnosis is that the half-life of the 
radioisotope be long enough so that it is still detectable at the time of maximum 
uptake by the target, but short enough so that deleterious radiation with respect to the 
host is minimized. Ideally, a radioisotope used for in vivo imaging will lack a particle 

10 emission, but produce a large number of photons in the 140-250 keV range, which 
may be readily detected by conventional gamma cameras. 

For in vivo diagnosis, radioisotopes may be bound to immunoglobulin either directly 
or indirectly by using an intermediate functional group. Intermediate functional 
groups which often are used to bind radioisotopes which exist as metallic ions to 
15 immunoglobulins are the bifiinctional chelating agems such as 
diethylenetriaminepemacetic acid (DTPA) and ethylenediaminetetraacetic acid 
(EDTA) and similar molecules. Typical examples of metallic ions which can be 
bound to the monoclonal antibodies of the invention are ^"In, ''Ru, *^Ga, "Ga, "As, 
■"Zr, and ^"^'Tl. 

20 A monoclonal antibody useful in the method of the invention can also be labeled with 
a paramagnetic isotope for purposes of m vivo diagnosis, as in magnetic resonance 
imaging (MRI) or electron spin resonance (ESR). In general, any conventional 
method for visualizing diagnostic imaging can be utilized. Usually gamma and 
positron emitting radioisotopes are used for camera imaging and paramagnetic 

25 isotopes for MRI. Elements which are particularly useful in such techniques include 
'"Gd. "Mn, '"Dy, "Cr, and "Fe. 



wo 96/14877 



PCTAJS95/14996 



-24- 



10 



15 



The present invention also provides a method for treating a subject with a cell 
proliferative disorder associated with of HIC-1 comprising administering to a subject 
with the disorder a therapeutically effective amount of reagent which modulates HIC- 
1 expression. In brain, breast and renal cancer cells, for example, the HIC-1 
nucleotide sequence is under-expressed as compared to expression in a normal cell, 
therefore, it is possible to design appropriate therapeutic or diagnostic techniques 
directed to this sequence. Thus, where a cell-proUferative disorder is associated with 
the expression of HIC-1 associated with malignancy, nucleic acid sequences that 
modulate HIC-1 expression at the transcriptional or translational level can be used. 
In cases when a cell proliferative disorder or abnormal cell phenotype is associated 
with the under expression of HIC-1. for example, nucleic acid sequences encoding 
mc-l (sense) could be administered to the subject with the disorder. 

The teim "ceU-proliferative disorder" denotes malignant as well as non-maUgnant cell 
populations which often appear to differ from the surrounding tissue both 
morphologically and genotypically. Such disorders may be associated, for example, 
with absence of expression of HIC-1 . Essentially, any disorder which is etiologically 
linked to expression of HIC-1 could be considered susceptible to treatment with a 
reagent of the invention which modulates HIC-1 expression 

The term "modulate" envisions the suppression of methylation of HIC-1 
20 polynucleotide when HIC- 1 is under-expressed. When a cell proliferative disorder is 
associated with HIC-1 expression, such methylation suppressive reagents as 5- 
azacytadine can be introduced to a cell. Alternatively, when a cell proliferative 
disorder is associated with under-expression of HIC-1 polypeptide, a sense 
polynucleotide sequence (the DNA coding strand) encoding HIC-1 polypeptide, or 5* 
regulatory nucleotide sequences (i.e., promoter) of HIC-1 in operable linkage with 
HIC-1 polynucleotide can be introduced into the cell. Demethylases known in the art 
could also be used to remove methylation. 

The present invention also provides gene therapy for the treatment of cell proliferative 
disorders which are mediated by HIC- 1 Such therapy would achieve its therapeutic 



25 
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effect by introduction of the appropriate HIC-1 polynucleotide which contains a HIC- 
1 structural gene (sense), into cells of subjects having the proliferative disorder. 
Delivery of sense HIC-1 polynucleotide constructs can be achieved using a 
recombinant expression vector such as a chimeric virus or a colloidal dispersion 
5 system. 

The polynucleotide sequences used in the method of the invention may be the native, 
unmethylated sequence or, alternatively, may be a sequence in which a 
nonmethylatable analog is substituted within the sequence. Preferably, the analog is 
a nonmethylatable analog of cytidine, such as 5-azacytadine. Other analogs will be 
1 0 known to those of skill in the art. Alternatively, such nonmethylatable analogs could 
be administered to a subject as drug therapy, alone or simultaneously with a sense 
structural gene for HIC-1 or sense promoter for HIC-1 operably linked to HIC-1 
structural gene. 

In another embodiment, a HIC-1 structural gene is operably linked to a tissue specific 
1 5 heterologous promoter and used for gene therapy For example, a HIC- 1 gene can be 
Ugated to prostate specific antigen (PSA) - prostate specific promoter for expression 
of HIC-1 in prostate tissue. Other tissue specific promoters will be known to those 
of skill in the art. Alternatively, the promoter for another tumor suppressor gene can 
be linked to the HIC-1 structural gene and used for gene therapy. 

20 Various viral vectors which can be utilized for gene therapy as taught herein include 
adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a retrovirus. 
Preferably, the retroviral vector is a derivative of a murine or avian retrovirus 
Examples of retroviral vectors in which a single foreign gene can be inserted include, 
but are not limited to: Moloney murine leukemia virus (MoMuLV), Harvey murine 

25 sarcoma virus (HaMuSV), murine mammary tumor virus (MuMTV), and Rous 
Sarcoma Virus (RSV). Most preferably, a non-human primate retroviral vector is 
employed, such as the gibbon ape leukemia virus (GaLV). thereby providing a 
broader host range than murine vectors, for example 
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A number of additional retroviral vectors can incorporate multiple genes. All of these 
vectors can transfer or incorporate a gene for a selectable marker so that transduced 
cells can be identified and generated. Retroviral vectors can be made target specific 
by inserting, for example, a polynucleotide encoding a sugar, a glycolipid, or a 
protein. Preferred targeting is accomplished by using an antibody to target the 
retroviral vector. Those of skill in the art will know of, or can readily ascertain 
without undue experimentation, specific polynucleotide sequences which can be 
inserted into the retroviral genome to allow target specific delivery of the retroviral 
vector containing the HIC-1 sense or antisense polynucleotide. 

Since recombinant retroviruses are defective, they require assistance in order to 
produce infectious vector particles. This assistance can be provided, for example, by 
using helper cell lines that contain plasmids encoding all of the structural genes of the 
retrovirus under the control of regulatory sequences within the LTR. These plasmids 
are missing a nucleotide sequence which enables the packaging mechanism to 
recognize an RNA transcript for encapsidation. Helper cell lines which have deletions 
of the packaging signal include but are not limited to T2, PA317 and PA12, for 
example These cell lines produce empty virions, since no genome is packaged. If 
a retroviral vector is introduced into such cells in which the packaging signal is intact, 
but the structural genes are replaced by other genes of interest, the vector can be 
packaged and vector virion produced. 

Another targeted delivery system for HIC-1 polynucleotide is a colloidal dispersion 
system. Colloidal dispersion systems include macromolecule complexes, 
nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water 
emulsions, micelles, mixed micelles, and liposomes. The preferred colloidal system 
of this invention is a liposome. Liposomes are artificial membrane vesicles which are 
useful as delivery vehicles in vitro and in vivo. It has been shown that large 
unilamellar vesicles (LUV), which range in size from 0 2-4 0 um can encapsulate a 
substantial percentage of an aqueous buffer containing large macromolecules. RNA, 
DNA and intact virions can be encapsulated within the aqueous interior and be 
delivered to cells in a biologically active form (Fraley, et al. Trends Biochem. Sci , 



10 



15 



20 



wo 96/14877 



-27- 



PCTAJS95/14996 



6:77, 1981). In addition to mammalian cells, liposomes have been used for delivery 
of polynucleotides in plant, yeast and bacterial ceUs. In order for a liposome to be an 
efficient gene transfer vehicle, the foUowing characteristics should be present: (1) 
encapsulation of the genes of interest at high efficiency while not compromising their 
5 biological activity; (2) preferential and substantial binding to a target cell in 
comparison to non-target cells; (3) delivery of the aqueous contents of the vesicle to 
the target cell cytoplasm at high efficiency; and (4) accurate and effective expression 
of genetic information (Mannino, etal, Biotechniques, 6:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, 
1 0 particularly high-phase-transition-temperature phospholipids, usually in combination 
with steroids, especially cholesterol. Other phospholipids or other lipids may also be 
used. The physical characteristics of liposomes depend on pH, ionic strength, and the 
presence of divalent cations. 

Examples of lipids useful in liposome production include phosphatidyl compounds, 
15 such as phosphatidylglycerol, phosphatidylcholine. phosphatidylserine, 
phosphatidylethanolamine, sphingolipids, cerebrosides, and gangliosides Particularly 
useful are diacylphosphatidylglycerols, where the lipid moiety contains from 14-18 
carbon atoms, particularly from 16-18 carbon atoms, and is saturated. Illustrative 
phospholipids include egg phosphatidylcholine, dipalmitoylphosphatidylcholine and 
20 distearoylphosphatidylcholine. 
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The targeting of liposomes has been classified based on anatomical and mechanistic 
factors. Anatomical classification is based on the level of selectivity, for example, 
organ-specific, cell-specific, and organelle-specific Mechanistic targeting can be 
distinguished based upon whether it is passive or active Passive targeting utilizes the 
natural tendency of liposomes to distribute to cells of the reticulo-endothelial system 
(RES) in organs v^hich contain sinusoidal capillaries Active targeting, on the other 
hand, involves alteration of the liposome by coupling the liposome to a specific Ugand 
such as a monoclonal antibody, sugar, glycolipid, or protein, or by changing the 
composition or size of the liposome in order to achieve targeting to organs and cell 
types other than the naturally occurring sites of localization 

The surface of the targeted delivery system may be modified in a variety of ways In 
the case of a liposomal targeted delivery system, lipid groups can be incorporated into 
the lipid bilayer of the liposome in order to maintain the targeting ligand in stable 
association with the liposomal bilayer Various linking groups can be used for joining 
the lipid chains to the targeting ligand 

In general, the compounds bound to the surface of the targeted delivery system will 
be ligands and receptors which will allow the targeted delivery system to find and 
"home in" on the desired cells A ligand may be any compound of interest which will 
bind to another compound, such as a receptor. 

In general, surface membrane proteins which bind to specific effector molecules are 
referred to as receptors. In the present invention, antibodies are preferred receptors 
Antibodies can be used to target liposomes to specific cell-surface ligands. For 
example, certain antigens expressed specifically on tumor cells, referred to as tumor- 
associated antigens (TAAs), may be exploited for the purpose of targeting HIC-1 
antibody-containing liposomes directly to the malignant tumor Since the HIC-1 gene 
product may be indiscriminate with respect to cell type in its action, a targeted 
delivery system oflfers a significant improvement over randomly injecting non-specific 
liposomes. Preferably, the target tissue is human brain, colon, breast, lung, and renal 
origin A number of procedures can be used to covalently attach either polyclonal or 
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monoclonal antibodies to a liposome bilayer. Antibody-targeted liposomes can 
include monoclonal or polyclonal antibodies or fragments thereof such as Fab, or 
F(ab')2. as long as they bind efficiently to an antigenic epitope on the target cells 
Liposomes may also be targeted to cells expressing receptors for hormones or other 
5 serum factors. 

For use in the diagnostic research and therapeutic applications suggested above, kits 
are also provided by the invention Such a kit may comprise a carrier means being 
compartmentalized to receive in close confinement one or more container means such 
as vials, tubes, and the like, each of the container means comprising one of the 
1 0 separate elements to be used in the method. 

For example, one of the container means may comprise a probe which is or can be 
detectably labelled. Such probe may be an antibody or nucleotide specific for a target 
protein or a target nucleic acid, respectively, wherein the target is indicative, or 
correlates with, the presence of HIC-l of the invention. Where the kit utilizes nucleic 
1 5 acid hybridization to detect the target nucleic acid, the kit may also have containers 
containing nucleotide(s) for amplification of the target nucleic acid sequence and/or 
a container comprising a reporter-means, such as a biotin-binding protein, such as 
avidin or streptavidin, bound to a reporter molecule, such as an enzymatic, florescent, 
or radionucleotide label. 

20 The invention also provides a method for identifying a tumor suppressor gene by 
detecting abnormal nucleic acid methylation, in particular, detecting CpG island 
hypermethylation in the regions of frequent allelic loss. The present invention has 
shown that aberrant methylation of normally unmethylated CpG islands can fiinction 
as a "mutation" to silence tumor suppressor gene transcription during tumor 

25 progression. The occurrence of the 1 7p 1 3 3 hypermethylation appears to correlate 
with both the timing and incidence of these allelic losses in the progression of brain, 
colon, and renal cancers. It is shown by the present invention that this CpG island 
harbors a tumor suppressor fflC-1 gene which is silenced by abnormal methylation. 
In other words, identification of such CpG islands has constituted an important 



wo 96/14877 



PCT/US95/14OT6 



-30. 

strategy for isolation of the new tumor suppressor HIC- 1 gene Therefore, the finding 
of this abnormality in chromosome areas which frequently undergo the tumor 
associated allelic losses that broadly define candidate tumor suppressor regions could 
facilitate the localization of the responsible genes The common methods used for 
5 detecting abnormal nucleic acid methylation are well known in the art and those 
skilled in the art should be able to use one of the methods accordingly for the purpose 
of practicing the present invention 

The following Examples are intended to illustrate, but not to limit the invention 
While such Examples are typical of those that might be used, other procedures known 
10 to those skilled in the art may alternatively be utilized. 

EXAMPLES 

HIC- 1 expression is ubiquitous in normal adult tissues. However, in cultured tumor 
cells and in primary cancers which exhibit hypermethylation of the associated CpG 
island, HIC-1 expression is reduced or absent For example, the expression of HIC- 1 
15 is absent in tumors with CpG island hypermethylation, including lung, colon, breast 
and brain tumors This expression pattern is consistent with a tumor suppressor gene 
fijnction for HIC-1. 

EXAMPLE 1 
MATERIALS AND METHODS 

20 L Subcloning of cosmid DNA 

Subclones of cosmid CI 3 A DNA (FIGURE lA) were prepared by isolation of 
muhiple restriction fragments on agarose gels and ligation of these into pBluescript 
plasmid (Stratagene). 

2. DNA sequencing 

25 Single stranded DNA was first isolated by growing plasmid DNA in 2xYT broth with 
75ug/ml ampiciUin and in the presence of lO'-lO' pfu/ml of VCSM13 (Stratagene) 
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(helper phage) for 2 hrs After isolation, the DNA was sequenced using the GIBCO 
BRL cycle sequencing kit. Generally, 22 base pair primers were end labeled with y- 
"P and cycle conditions were 95X for 1 cycle followed by 20 cycles of 95»C for 10 
sec. and 65°C for 10 sec Reaction products were analyzed on 10% acrylamide/8 M 
5 urea gels. 

3. Southern and Northern hybridizations 

Isolation procedures for DNA and poly A+ RNA, agarose gel running conditions, a- 
"P labelUng of probes, filter hybridization and wash conditions are as previously 
described (Baylin, S.B., et ai. Cancer Cells, 3;383-390. 1991; Jones, P.A.. et ai, 

10 Cancer Res.. 54: 1-23, 1990; Herman, J.G., et ai. Proc. Nafl Acad. Sci.. in press, 
1994, Ottaviano, Y L., et al.. Cancer Res.. 54:2552-2555, 1994, Issa, J-P., et ai. 
Nature Genetics, in press; Steenman, M.J.C., et al.. Nature Genetics. 2:433-439, 1994, 
and Gish, W., etai. Nature Genetics, 3 .266-272, 1993). Radioautograms were either 
exposed at -TO'C for various times or in a phosphoimager casette. followed by 

15 exposure and analysis in the phosphoimager Image Quant program (Molecular 
Dynamics) Preparation of single strand, a-"P-labeled RNA probes for use in some 
Northern hybridizations was accomplished by in vitro transcription, using or 
polymerase, of DNA inserts in the various cosmid sublcones shown in FIGURE 1 A 

4. RNAse protection assays 

20 Preparation of a-"P-labeled RNA probes from the various cosmid subclones 
(HGURE 1 A), Uquid hybridization to RNA samples, and post-hybridization digestion 
by RNAse were all performed with the Ambion MAXIscript and RPAII kits according 
to the manufacturer's specifications. In general, 8x10* cpm of probe was hybridized 
to 10 ng of total RNA for 12-15 h at 45°C. Products of RNAse digestion were 

25 analyzed on a 6% acrylamide/8 M urea gel. Lengths of hybridization probes were 
determined by positions of various restriction cuts of the plasmid insert DNA For 
assessmem of RNA loading, a 250 bp GAPDH probe was prepared by Hinc II 
restriction and co-hybridized with RNA in all reactions. 



5. Exon trapping 
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Exon trapping was performed with subclone 26 (FIGURE 1 A) using the GIBCO BRL 
Exon Trapping System, as per manufacturer's protocol. 

6. Cell cultures and tissue specimens 

Normal human fibroblast lines WI-38 and IMR-90 and colon cancer line. CaCOj, 
5 were obtained from the American Tissue Culture Collection (ATCC, Rockville, MD). 
The NCI-H209 line of human small cell lung carcinoma has been previously 
described (Carney, D N., et ai. Recent Results Cancer Res. , 92: 1 57-166, 1985) All 
established breast cancer lines were utilized, as detailed in FIGURE 5, in a recent 
study (Herman, J G., et ai. Proc. Nat'l. Acad Sci.. 919700-9704, 1994) and were 

1 0 kindly provided by Dr. Nancy Davidson. A cell ftjsion system of tumor progression 
consisting of normal donor fibroblast line GM229 and the HT1080 line of 
fibrosarcoma cells, plus their fusion products, SFTH 300 and SFTH 300 TRl, were 
a gift from Dr. B Weismann. AJl samples of fresh, non-cultured, normal and 
neoplastic human tissues were those obtained as described (Herman, J G; et al. supra, 

1 5 Ottaviano, Y.L., et ai. supra; Issa, J-P., et al.. supra, Steenman, M. J.C., et ai. supra, 
and Gish, V^ ,etaL, supra). 

EXAMPLE 2 

roENTTFICATION OF NEW TUMOR SUPPRKSSOR r.ny F 

20 To characterize the region encompassing the aberrantly methylated CpG island, a 
series of subclones were prepared (FIGURE lA) from the 17p cosmid C-13A 
(Ledbetter, D.H., etai. Proc. Natl Acad. Sci. USA. 56 5 136, 1989; EI-Deiiy, W S., 
et al.. Nature Genetics, 1:45-49, 1992; Kern, S.E., et al.. Science, 252:1708. 1991; 
Funk, W.D., etal.. Moi & Cell. Biol, 12:2866. 1992) previously shown to contain the 

25 cluster of methylation sensitive Not I sites hypermethylated in tumors. Using these 
as probes for "zoo blots", three regions (HGUREl A: plasmids CI, CII, and 400) were 
found which hybridized, under stringent conditions, to restriction fragments in bovine 
and murine DNA. Traditional positional cloning approaches were impeded by high 
non-specific hybridization of these probes to human DNA and cDNA libraries. 

30 probably due to the high GC content of the area Therefore, most of the 1 1 kb region 
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(FIGURE 1 A) was sequenced and analyzed by the Grail computer program (Gish, W , 
et ai, D.J., Nature Genetics, 3:266, 1993). 

FIGURE lA is a diagram showing a map of an 11 0 kb region of cosmid C-13A 
which contains a 50 kb human DNA insert harboring the region of chromosome 
5 17pl3.3 previously shown to have hypermethylation in multiple human tumor types 
(Makos, M , etai, Proc. Natl. Acad ScL USA, 89:1929, 1992; Makos, M., et ai^ 
Cancer Res., 53:2715, 1993; Makos, M., et ai. Cancer Res 53:2719, 1993) The 
position of the YNZ22 probe, EcoRI (E) restriction site and the location of a series of 
cosmid subclones which were prepared to span the area are shown 

10 FIGURE IB is a schematic for the HIC-1 gene which was found to be encompassed 
within the region shown in FIGURE 1 A and for which the amino acid sequence is 
shown in FIGURE 2B. Shown are: potential p53 binding site, TATAA = the TATA 
box sequence 40 bp upstream from the transcription start site; 5' UTR = the 1st 
untranslated exon; ATG = the most 5* translation start site; ZIN (zinc finger N- 

1 5 terminus) = the 478bp exon encompassing the highly conserved region (FIGURE 2A) 
of the Zin domain subfamily of zinc finger transcription factors; rectangle with shaded 
bars represents the 2015 bp last exon of HIC-1 and each shaded bar represents one of 
the 5 zinc fingers (FIGURE 23) clustered in this 3* region of the gene, TAG = 
translation stop site in the HIC-1 gene; AATAAA = polyadenylation signal site found 

20 835 bp from the translation stop site. FIGURE IC shows the nucleotide and deduced 
amino acid sequence of HIC-1 . 

Two independent regions of excellent coding potential were revealed between the N3 
to N7 Not I restriction sites (FIGURE 1 A). Blast program (Altschul, S.F., et ai, J. 
Mol. Biol., 215:403, 1990) analysis revealed distinct amino acid homologies 

25 (FIGURES IB and 2 A), within one of the independent regions, to a highly conserved 
N-terminal motif, termed the Zin (zinc finger N-terminal) domain, which is present 
in each member of a recently defined subset of zinc finger transcription factors 
(Harrison and Travers,£WBO J 9:207, 1990;diBeUo, etaL, Genetics. 129:385, 1991, 
Numoto, etaL, Nucleic Acids Res 21 3767, 1993; Chardin, et aL. Nucleic Acids Res. 

30 19:1431, 1991). In addition to the Zin domain, five Kruppel type CyS2-His2 zinc 
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fingers (Ruppert, J.M., etaL Mol & Cell Biol, 8:3104.31 13, 1988) characteristic of 
the 3' region of these same proteins, were also identified (FIGURES IB and 2B). This 
novel gene was named HIC-1 (hypermethylated in cancer). 

EXAMPLE 3 

5 CHARACTERIZATION OF HIOl 

A combination of RNAse protection strategies, exon trapping studies, and Northern 
blot analyses, were utilized to characterize expression of HIC-1 and to define the 
genomic structure of the gene (FIGURES IB and IC; SEQ ID NO: 1 and 2) The start 
of transcription was identified within 40 bp downstream fi-om a TATA box sequence 

1 0 (FIGURE IB) which precedes an untranslated first exon. The putative ATG site and 
the TXti domain are located in a 476 bp second exon and are in a similar position to 
those of the 8 other Zin domain proteins (FIGURE 2 A) The 5 zinc fingers 
(FIGURES IB and 2B) reside in a 2015 bp final exon, containing a translation stop 
site 835 bp upstream from the polyadenylation signal, AATAAA. The HIC-1 gene 

15 (FIGURES IC and 2B ), structured similarly to the other Zin domain proteins, is 
encompassed by three exons v/ithin the CpG rich 3 0 kb region between Not I sites N3 
and N7 (FIGURE 1) 

HGURE 2A and SEQ ID NO:2 show the amino acid sequences of HIC- 1 The HIC- 1 
amino acid sequence is compared with the conserved N-terminus region of the other 

20 members of the Zin domain zinc finger family. In the parentheses, the numbers 
indicate the position of the conserved region relative to the translation start site of 
each gene The darkest shading shows position of amino acids which are identical for 
at least five of the 9 proteins and the lighter shading shows position of conservative 
amino acid differences between the family members. D - drosophila; M = murine, 

25 H = human. The bracket of amino acids at the bottom represents an area in HIC- 1 not 
found at this position in the other family members. 

FIGURE 2B and SEQ ID NO:3 show the entire coding region of the HIC-1 gene. The 
deduced amino acid sequence for the two coding exons of HIC-1, as defined by the 
sequence analyses and expression strategies outlined in the text, are shown. The 5 
30 zinc fingers in the 3' half of the protein are shown by the shaded boxes. 
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FXAMPLE 4 
ANALYSTS OF HIC-l GENF EXPRESSION 

mC-l was found to be ubiquitously expressed gene. By Northern analysis of poly A+ 
RNA from multiple normal tissues, probes from the HIC-1 Zin domain, zinc finger 
5 regions, and 3' untranslated regions inclusive of the polyadenylation site, all identified 
the same predominant 3.0 kb transcript. FIGURE 3 shows a Northern analyses of 
HIC-1 gene expression S = spleen. The = thymus; P= prostate; Te = testis; 0 = 
ovary; SI = small intestine; B = peripheral blood cells. The band above the 4 4 kb 
marker co-hybridizes with ribosomal RNA. The -1.1 kb band has not yet been 
10 identified but could be an alternate splice product since it was not detected with 
probes from the zinc finger or 3* untranslated regions of HIC- 1 . 

FIGURE 4 A shows RNAse protection assays of HIC-1 gene expression in a variety 
of normal and neoplastic human tissues In all panels, the top asterisk marks the 
position of the undigested 360bp HIC-1 gene RNA probe which was derived from the 

15 region containing the zinc fingers in cosmid subclone 600 (FIGURE lA) The 
protected HIC-1 fragment (300bp) is labeled HIC-l. FIGURE 4A compares 
expression in 10 ug of total RNA from 2 established culture lines of normal human 
fibroblasts (WI-38 and IMR-90) to the HT 1080 culture line of fibrosarcoma cells 
(Fibro-C), from 3 different samples of normal colon (Colon - N) to the colon 

20 carcinoma cell line, CaCO^ (Colon-C), and from a sample of normal lung (Lung-N) 
to the established line of human small cell lung carcinoma, NCI-H209 (Lung-C). 

FIGURE 4B shows the RNAse protection assay for 10 ug of RNA from 6 different 
established culture lines of breast carcinoma (lane 1 MDA231. lane 2 HS58T; lane 3 
MDA468, lane 4 T47D; lane 5 MCF7; lane 6 MDA453), each of which has extensive 
25 methylation of Not 1 sites of the HIC-1 CpG island. FIGURE 4C shows the RNAse 
protection assay for 10 ug of RNA from normal fetal brain (B) compared to a series 
of non-cultured brain tumors (1 anaplastic astrocytoma (AA) and 8 more advanced 
glioblastomas (lanes 1-8). 
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The 3.0 kb transcript was found in all adult tissues tested with especially high levels 
in lung, colon, prostate, thymus, testis, and ovary (FIGURE 3). With the Zin domain 
probe, a 1. 1 kb transcript was also detected in some tissues which may represent an 
alternatively spliced product (FIGURE 3). RNase protection assays (RPAZ Kit- 
5 Ambion), using a probe from plasmid 600 (FIGURE lA), validated the ubiquitous 
expression of HIC-1, protecting transcripts of predicted size in cultured fibroblasts 
(FIGURE 4A) and non-cultured colon mucosa (FIGURE 4A), lung (FIGURE 4A), 
and brain (FIGURE 4C) 

By RNAse protection assays, HIC-1 expression was found to be absent or decreased 
10 in neoplastic cells which have aberrant HIC-1 CpG island methylation. Little or no 
expression (FIGURE 4A) was detected in cultured cancer cell lines of colon, lung, 
and fibroblast, all previously shown to be fiilly methylated at Not I sites 3 through 7 
The same finding was true for 6 cultured breast cancers (FIGURE 4B), all of which 
exhibited hypermethylation of Not I sites 3 through 7. 

1 5 Furthennore, in primary colon tumors, HIC-1 expression was 2 to 1 7-fold decreased 
in a non-cultured human colon polyp and 3 primary colon tumors, as compared to the 
corresponding normal colon. Finally, the absence of HIC-1 expression in primary, 
non-cultured brain tumors was found in tumors that exhibited aberrant 
hypermethylation of the CpG island An anaplastic astrocytoma which exhibited a 

20 fuU methylation pattern of the HIC- 1 CpG island, did not express this gene (FIGURE 
4C), as compared to normal brain. In 4 glioblastomas, in which both DNA and RNA 
were available, two expressed HIC-1 either weakly (FIGURE 4C, lane 1) or not at all 
(FIGURE 4C, lane 4) and had predominantly hypermethylated alleles, while two with 
unmethylated alleles expressed the gene at levels equal to adjacent normal brain 
25 (FIGURE 4C, lanes 2 and 3). 

Four additional glioblastomas for which RNA was available were also studied. One 
expressed HIC-l weakly (FIGURE 4C, lane 5). one had no expression (FIGURE 4C. 
lane 6), and two tumors expressed this gene (FIGURE 4C, lanes 7-8). 
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In addition, hypermethylation of HIC-1 was analyzed in several primary tumors and 
cultured cell lines by DNA analysis as follows. Southern analyses of DNA from 
control and 24 hour infected cells which was digested with EcoRI (12U/ug DNA) plus 
Not I (20U/ug), were probed with a-''P-labeled YNZ22 (FIGURE lA) exactly as 
5 detailed in previous studies (Makos, et ai, supra, 1992, 1993). Filters were imaged 
in the Phosphoimager (Molecular Dynamics). The results shown in Table 1 indicate 
that HIC- 1 is found to be hypermethylated in a variety of tumors and cell lines from 
various origins including brain, colon, renal, hematopoietic, and prostate cancers and 
tumors. 

10 TABLE 1 

HYPERMETHYLATION OF HlC-1 IN TUMORS AND CELL LINES 



PRIMARY TUMORS 

BRAIN TUMORS 



CULTURED CELL LINES 



15 



20 



Low Grade 
Astrocytomas 

Anaplastic 
Astrocytomas 

Glioblastoma 
Multiforme 

Medulloblastoma 

COLON CANCERS 

Polyps 

Carcinomas 



1 

7 



METH % 

7 100 

4 80 

6 75 

4 80 



j| METH % 



100 



90 



Glials 



2 2 100 



Carcinoma 6 7 85 



25 



LONG CAKCgRS 



Carcinomas 



5 



0 0 



Carcinoma 16 12 75 
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TABLg 1 (COM'T) 



10 



RHNAX CANCgRS 



Early Stage 


8 


4 


50 






T -9 ^ A C ^ A A 

ijaue ouage 


-a 


^ 






21 16 80 














Lymphomas 


3 


1 


33 


Lymphomas 


8 5 60 


CML/Blast 


8 


7 


87 






AML 


13 


10 


80 






ALL 


10 


8 


80 








i 


METH 


% 




4 METH % 


BRSAST CANCBRS 












Cancer 


24 


15 


62 


Cancers 


6 6 100 



PROSTATB CANCKRS 

Cancer 17 



17 100 



Cancer 



5 4 80 



15 gWDOMKTRIALi CANCBR 

Cancer 6 



4 €7 



20 



NKUROBIA3TOMA3 

early/late stage 12 
(amount of 
methylation LOW) 



2 16 



Cancers 4 4 100 



EXAMPLE 5 

INTERACTION OF P53 WITH HIC-1 EXPRESSION 

Consistent with the hypothesis that a suppressor gene exists at I7pl3.3 which may 
interact with p53, the present invention identifies a potential p53 binding site 4 kb 5' 

25 to the TATA box in the HIC-1 gene (FIGURE IB) Therefore, the p53 response of 
the HIC-1 gene was tested by using a colon cancer cell line (SW480) in which the p53 
responsive gene, W AF- 1 , had been shown previously to be induced by expression of 
wild type p53 (El-Deiry, et ai, Ce//, 75:817-825, 1993). This cell line contains one 
17p chromosome, a mutant p53 allele, and a fiiUy methylated HIC-1 CpG island 

30 Furthermore, the cell line SW480 is severely growth arrested by exogenously 
expressing the wild type p53 gene (Baker, S J , etai. Science, 249:912-915, 1990) 
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expressing the wUd type p53 gene (Baker, S.J., etal.. Science. 249:912-915, 1990) 

FIGURE 5 shows an RNAse protertion assay, as detailed in FIGURE 4, after 
infection of an adenoviral vector containing either the p-galactosidase gene or the 
wild type human p53 gene into the SW480 line of human colon cancer cells 
5 (Uninfected, normal, control human fibroblasts (F), uninfected SW480 cells (U), 
SW480 ceUs infeaed with the P-galactosidase gene (GAL), and SW480 cells infected 
with the p53 gene (p53)). Positions of the undigested HIC-1 and GAPDH probes and 
of the HIC-1 and GAPDH transcripts are marked exactly as in FIGURE 4 

HIC-1 is expressed at only low levels in this cells line (Fig 5 A - U). When the wild 
10 type p53 gene is exogenously expressed in the SW480 cells, the level of HIC-1 
expression is upregulated 20 fold (Fig 5 - p53), as compared to control cells (U & 
GAL) These results suggest that the tumor suppressor gene p53 activates HIC-1 
expression, either directly or indirectly. However, since a p53 binding sites has been 
identified 4 Okb upstream from the transcription start site (see enclosed map), it 
1 5 suggests a direct interaction between p53 and HIC- 1 . We are working to validate this 
type of interaction. 

SUMMARY OF EXAMPLES 
HIC-1 plays a significant role in normal and neoplastic cells At least four other genes 
have thus far been identified as potential downstream targets of p53, including WAFl 
20 (El-Deiry, W.S., et al., supra.) MDM2 (Chen, C.Y.. ef at.. Proc Natl. Acad. Sci. USA, 
91:2684-2688, 1994), GADD45 (Kastan, M B , et ai. Cell, 71 587-597, 1992) and 
BAX (Miyashita, T., et ai. Oncogene. 9:1799-1805. 1994) HIC-1 probably 
fijnctions as a transcription factor, as inferred by its structure and the characteristics 
of the other members of the Zin domain family. Two drosophila members, tram-track 

25 and broad complex, are transcriptional repressors which help regulate segmental 
development (Harrison and Travers, EMBOJ9:201, 1990; di Bello, et al. Genetics, 
129:385, 1991). A third drosophila protein, GAGA appears to fiinction by dynami- 
cally blocking the formation of nucleosomal structures which would impede trans- 
criptional activation of promoter regions (Tsukiyama, T., et ai. Nature, 362 525-532, 

30 1994). The murine Zin domain gene. MZF5, has in-vitro transcriptional repressor 
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re- 
activity for c-myc and thymidine kinase promoters (Numoto, e( aL, Nucleic Acids 
Res , 21:3767, 1993) Finally, two of the 4 other human Zin domain proteins were 
found as components of translocations in humem neoplasms (Chardin, et al., Nucleic 
Acids Res., 19:1431, 1991; Hromas, etaL, J. BioL Chem., 266:14183, 1991, Chen, 
5 ei al.,EMBO J., J2:l 161, 1993) Second, it is necessary to determine the precise 
interaction between p53 and the HIC-1 promoter. 

In summary, the present invention identifies a new gene at 17pl3.3, HIC-1, for which 
the expression pattern, structural motifs, chromosomal location, and p53 
responsiveness are suggestive of an important function in tumorgenesis. 

10 Identification of the precise p53 pathway in which HIC-1 is involved should clarify 
the role of this gene in normal and neoplastic cells. Finally, the results suggest that 
in tumor DNA, identification of hypermethylated CpG islands associated with regions 
of allelic loss could facilitate the localization and cloning of candidate tumor suppres- 
sor genes as well as function as markers for recurrent abnormal growth or cells which 

1 5 may be resistant to particular therapeutic regimens 



The foregoing is meant to illustrate, but not to limit, the scope of the invention. 
Indeed, those of ordinary skill in the art can readily envision and produce further 
embodiments, based on the teachings herein, without undue experimentation. 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4616 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE: DNA (genomic) 

ivii) IMMEDIATE SOURCE: 

(B) CLONE: HIC-1 polynucleotide 

(ix) FEATURE: 
5 (A) NAME/KEY: CDS 

(B) LOCATION: 1 . .4616 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CCCGGCCCGC CGGGACCGCA GGTAACGGGC CGCGGGGCCC CGCGGGCCAG GAGGGGAACG 6 0 

GGGTCGGGCG GGCGAGCAGC GGGCAGGGGA GCTCAGGGCT CGGCTCCGGG CTCTGCCGCC 120 

10 GGATTTGGGG GCCGCGAGGA AGAGCTGCGA GCCGAGGGCC TGGGGCCGGC GCACTCCTCC 180 

CGCCCTGTCT GCAGTTGGAA AACTTTTCCC CAAGTTTGGG GCGGCGGAGT TCCGGGGGAG 24 0 

AAGGGGCCGG GGGAGCCGCG GAGGGAGGCG CCGGGCCCGC GCGTGTAGGG CCCAGGCCGA 3 00 

GGCCGGGACG CGGGTGGGGC GCAGGCCCGG GTCAGGGCCG CAGCCGGCTG TGCGCCGTGC 360 

CCGCCCGGGG CGCTGCCCCC TCCCTCCCCT GGGAGCTGCG TGGCTCCCCC CTCCCCCCCA 4 20 

15 CCTGCTTCCT GCCTCAGCCT CCTGCCCCGA TATAACGCCC TCCCCGCGCC GGGCCCGGCC 4 80 

TTCGCGCTCT GCCCGCCACG GCAGCCGCTG CCTCCGCTCC CCGCGCGGCC GCCGCCCGGG 54 0 

CCCCGACCGA GGGTTGACAG CCCCCGGCCA GGGCGGCGCC AGGGCGGGCA CCGCGCTCCC 600 

CTCCTCCGTA TCACTTCCCC CAACTGGGGC AACTTCTCCC GAGGCGGGAG GCGCTGGTTC 6 60 

CTCGGCTCCC TTTCTCCCTA CTTGGGTAAA GTTCTCCGCC CTGAATGACT TTTCCTGAAG 72 0 

20 CGGACATTTT ACTTAAATCG GGTAACTGTC TCCAAAAGGG TCACTGCGCC TGAACAGTTT 7 80 

TCTTCTCGGA AGCCCCAGCA CCCAGCCAGG TGCCCTGGGG CGTGCAGGCC GCCCTGGCCT 84 0 

CCCCTCCACC GGCGGCCGCT CACCTCCTGC TCCTTCTCCT GGTCCGGGCG GGCCGGCCTG 90 0 

GGCTCCCACT CCAGAGGGCA GCTGGTCCTT CGCCGGTGCC CAGGCCGCAG GGCTGATGCC 96 0 

CCCGCTCAGC TGAGGGAAGG GGAAGTGGAG GGGAGAAGTG CCGGGCTGGG GCCAGGCGGC 102 0 

25 CAGGGCGCCG CACGGCTCTC ACCCGGCCGG TGTGTGTCCC CGCAGGAGAG TGTGCTGGGC 10 80 
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AGACGATGCT GGACACGATG GAGGCGCCCG GCCACTCCAG GCAGCTGCTG CTGCAGCTCA 
ACAACCAGCG CACCAAGGGC TTCTTGTGCG ACGTGATCAT CGTGGTGCAG AACGCCCTCT 
TCCGCGCGCA CAAGAACGTG CTGGCX5GCCA GCAGCGCCTA CCTCAAGTCC CTGGTGGTGC 
ATGACAACCT GCTCAACCTG GACCATGACA TGGTGAGCCC GGCCGTGTTC CGCCTGGTGC 
5 TGGACTTCAT CTACACCGGC CGCCTGGCTG ACGGCGCAGA GGCGGCTGCG GCCGCGGCCG 

TGGCCCCGGG GGCTGAGCCG AGCCTGGGCG CCGTGCTGGC CGCCGCCAGC TACCTGCAGA 
TCCCCGACCT CGTGGCGCTG TGCAAGAAAC GCCTCAAGCG CCACGGCAAG TACTGCCACC 
TGCGGGGCGG CGGCGGCGGC GGCGGCGGCT ACGCGCCCTA TGGTCGGCCG GGCCGGGGCC 
TGCGGGCCGC CACGCCGTCA TCCAGGCCTG CTACCCGTCC CCAGTCGGGC CTCCGCCGCC 
10 GCCTGCCXSCG GAGCCGCCCT CGGGCCCAGA GGCCGCGGTC AACACGCACT GCGCCGAGCT 

GTACGCGTCG GGACCCGGCC CGGCCGCCGC ACTCTGTGCC TCGGAGCGCC GCTGCTCCCC 
TCTTTGTGGC CTGGACCTGT CCAAGAAGAG CCCGCCGGGC TCCGCGGCGC CAGAGCGGCC 
GCTGGCTGAG CGCGAGCTGC CCCCGCGCCC GGACAGCCCT CCCAGCGCCG GCCCCGCCGC 
CTACAAGGAG CCGCCTCTCG CCCTGCCGTC GCTGCCGCCG CTGCCCTTCC AGAAGCTGGA 
15 GGAGGCCGCA CCGCCTTCCG ACCCATTTCG CGGCGGCAGC GGCAGCCCGG GACCCGAGCC 

CCCCGGCCGC CCCAACGGGC CTAGTCTCCT CTATCGCTGG ATGAAGCACG AGCCGGGCCT 
GGGTAGCTAT GGCGACGAGC TGGGCCGGGA GCGCGGCTCC CCCAGCGAGC GCTGCGAAGA 
GCGTGGTGGG GACGCGGCCG TCTCGCCCGG GGGGCCCCCG CTCGGCCTGG CGCCGCCGCC 
GCGCTACCCT GGCAGCCTGG ACGGGCCCGG CGCGGGCGGC GACGGCGACG ACTACAAGAG 
20 CAGCAGCGAG GAGACCGGTA GCAGCGAGGA CCCCAGCACC GCCTGGCGGC CACCTCGAGG 

GCTACCCATG CCCGCACCTG GCCTATGGCG AGCCCGAGAG CTTCGGTGAC AACCTGTACG 
TGTGCATTCC GTGCGGCAAG GGCTTCCCCA GCTCTGAGCA GCTGAACGCG CACGTGGAGG 
CTCACGTGGA GGAGGAGGAA GCGCTGTACG GCAGGGCCGA GGCGGCCGAA GTGGCCGCTG 
GGGCCGCCGG CCTAGGGCCC CCTTTTGGAG GCGGCGGGGA CAAGGTCGCC GGGGCTCCGG 

SUBSTITUTE SHEET (RULE 26) 



1140 



1200 



1260 



1320 



1380 



1440 



1500 



1560 



1620 



16B0 



1740 



1800 



1860 



1920 



1980 



2040 



2100 



2160 



2220 



2280 



2340 



2400 



2460 



2520 
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GTGGCCTGGG AGAGCTGCTG CGGCCCTACC 
ACCCGGCCAC GCTGCGGCAG CACGAGAAGA 
CCATCTGCGG GAAGAAGTTC ACGCAGCGTG 
TGGGCCTCAA GCCCTTCGCG TGCGACGCGT 
5 TCACCCGGAC GCACATGCGC ATCCACCCTC 

GCGGCGGCAA GTTCGCACAG CAACGCAACC 
GGGGCGCGGC GGCGCGGCCG GGGCGCTGGC 
CCCCGACGGC AAGGGCAAGC TCGACTTCCC 
GCCGAGCAGC TGAGCCTGAA GCAGCAGGAC 
10 ACCACGCACT TCCTGCACGA CCCCAAGGTG 

TTCACGGCCG AGCTGGGCCT CAGCCCCGAC 
CACCTGGCGG CCGGGCCCGA CGGCGGACCA 
CTCGCCAGCC CGCTCTGTCG CTGCTGCGCG 
GGCGGCGCGC AGGGCCCACT GTGCCCGGGA 
15 ACCTCTCGGC GGCCTCACCT GGCCTCACTG 

AACCCCGGGA CGGGGTGGGA TGGGGTAAGG 
CAAAGGAGAC CCCAGGCCCC TCCCGCCTCT 
TCCGCGCTGC TCTTAGAGGG GGAGGGGTGT 
GGCCCTTGCG ACCACACCCA TTCTCACTGT 
20 AGAGTTGGGG AGTGGGGAGG GGACTGAGCC 

CCACCCCGGG ACTGATAATG TGAAGTTCCT 
CAACCCTTCC TTCCTCAGTC ACCAAGGGCG 
TACCACCAGG TCTCCCACTC CCGCGGTGCC 
TATTTATTGC ATGCGCCCCG GCGGCCCCCC 
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GCTGCGGCTC GTGCGACAAG AGCTACAAGG 2 580 

CGCACTGGCT GACCCGGCCC TACCCATGCA 26 4 0 

GGACCATGAC GCGCCACATG CGCAGCCACC 2 700 

GCGGCATGCG GTTCACGCGC CAGTACCGCC 2 76 0 

GCGGCGAGAA GCCCTACGAG TGCCAGGTGT 2 82 0 

TCATCAGCCA CATGAAGATG CACGCCGTGG 2 880 

GGGCTTGGGG GGGCTCCCCG GCGTCCCCGG 2 94 0 

CGAGGGCGTC TTTGCTGTGG CTCGCTCACG 3 00 0 

AAGGCGGCCG CGACCGAGCT GCTGGCGCAG 3 06 0 

GCGCTGGAGA GCCTCTACCC GCTGGCCAAG 3120 

AAGGCGGCCG AGGTGCTGAG CCAGGGCGCT 3180 

TCGACCGTTT CTCTCCCACC TAGAGCGCCC 3 24 0 

GCCCTGGCCC GCACCCCAGG GAGCGGCGGG 3 3 00 

CAACCGCAGC GTCGCCACAG TGGCGGCTCC 3 3 60 

CTTCGTGCCT TAGCTCGGGG GTCGGGGGAG 3 420 

GAAATTTATA TTTTTGATAT CAGCTTTGAC 3 4 80 

TCCTGTGGTT CGTCGGCCCC CTCCCCCGGC 3 54 0 

CACTGTCGGG GCACTCCTAG CCCTACCTCC 3 6 00 

GAATCTCCCC GCTGGGTCGG AGCGTCGGGC 36 60 

GGCCGGAGGC CCCCGCACCC CCGCTCCCAC 3 720 

CATTTTGCAC AAGTGGCACT AGCCCAGGGC 3 7 80 

GGGAGTTCTG GAGTCGGAAG GCGAAGAGCC 3 84 0 

CTCCCTTCCC TTCCCTGCGG CCCCGGACCA 3 900 

ATCCCGAGCC CAGGCTGGGC TGGGCTGGAA 3 96 0 
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CGCGGTCTCT TTAGCTCCCT CCTCTTCGTT TGTATATTTC CTACCTTGTA CACAGCTCTT 4 0 20 

CCAGAGCCGC TTCCATTTTC TATACTCGAA CCAAACAGCA ATAAAGCAGT AACCAAGGAC 4 08 0 

CCCGACCCCG CTGCTCTCTT CTGCCCCTGC ACAAGGACCT GGATGCTGCG CCCGCTGGGT 414 0 

GGAGGAGCCA GAAAGGGCCA CCCTCACACA GGTGCAGAGG CTTGGACCTG CCTCCCTCCC 4 2 00 

5 CAGTCCCAGA AACAGATCAG CAAGAGGTCA GGTATGTTTC ATAACTAAAA ATTTATTAAG 4 2 60 

GAAACAAAAC CAGTGCTGCA AACGGGACAG AAAGGAGAGC TGGGTCTCCC TCCCGACCAC 4 3 20 

CCAGTCATCG GCCTTCCAGC TGGGGAGAGA ATCTTAAAGG AGAGGCCGGG GACCCTGTAC 4 3 80 

TCCAAAGAGC CCAGTCTTCT GAGACTCTAG GGGACTCCTA CCCCCAAACT ACTGGCCTTG 4 44 0 

GCTCCCCTAC ACGGTACCCC ATCGCTTCTG GCATAGTCCT GGGCCTCAGG GAGGGCAGAG 4 500 

10 CTGCGCACCC ATCCTCCAGG CAGGCTGTGC AGTCAGGCCA TGGGCTCTGG GGTATCCCCC 4560 
ACTGGTCCCA TTAAGATTTG CCCCTGGCTC CACCGAAAAC CCCGTCTTCC CCTAAG 
(2) INFORMATION FOR SEQ ID NO : 2 : 



4616 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4112 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECXXLE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 
20 (B) CLONE: HIC- 1 coding polynucleotide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1086.. 2726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
25 CCCGGCCCGC CGGGACCGCA GGTAACGGGC CGCGGGGCCC CGCGGGCCAG GAGGGGAACG 

GGGTCGGGCG GGCGAGCAGC GGGCAGGGGA GCTCAGGGCT CGGCTCCGGG CTCTGCCGCC 12 0 

GGATTTGGGG GCCGCGAGGA AGAGCTGCGA GCCGAGGGCC TGGGGCCGGC GCACTCCTCC 180 

CGCCCTGTCT GCAGTTGGAA AACTTTTCCC CAAGTTTGGG GCGGCGGAGT TCCGGGGGAG 24 0 

SUBSTITUTE SHEET (RULE 26) 
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AAGGGGCCGG GGGAGCCGCG GAGGGAGGCG CCGGGCCCGC GCGTGTAGGG CCCAGGCCGA 3 00 

GGCCGGGACG CGGGTGGGGC GCAGGCCCGG GTCAGGGCCG CAGCCGGCTG TGCGCCGTGC 360 

CCGCCCGGGG CGCTGCCCCC TCCCTCCCCT GGGAGCTGCG TGGCTCCCCC CTCCCCCCCA 42 0 

CCTGCTTCCT GCCTCAGCCT CCTGCCCCGA TATAACGCCC TCCCCGCGCC GGGCCCGGCC 4 80 

5 TTCGCGCTCT GCCCGCCACG GCAGCCGCTG CCTCCGCTCC CCGCGCGGCC GCCGCCCGGG 54 0 

CCCCGACCGA GGGTTGACAG CCCCCGGCCA GGGCGGCGCC AGGGCGGGCA CCGCGCTCCC 6 00 

CTCCTCCGTA TCACTTCCCC CAACTGGGGC AACTTCTCCC GAGGCGGGAG GCGCTGGTTC 660 

CTCGGCTCCC TTTCTCCCTA CTTGGGTAAA GTTCTCCGCC CTGAATGACT TTTCCTGAAG 72 0 

CGGACATTTT ACTTAAATCG GGTAACTGTC TCCAAAAGGG TCACTGCGCC TGAACAGTTT 7 80 

TCTTCTCGGA AGCCCCAGCA CCCAGCCAGG TGCCCTGGGG CGTGCAGGCC GCCCTGGCCT 84 0 

CCCCTCCACC GGCGGCCGCT CACCTCCTGC TCCTTCTCCT GGTCCGGGCG GGCCGGCCTG 90 0 

GGCTCCCACT CCAGAGGGCA GCTGGTCCTT CGCCGGTGCC CAGGCCGCAG GGCTGATGCC 96 0 

CCCGCTCAGC TGAGGGAAGG GGAAGTGGAG GGGAGAAGTG CCGGGCTGGG GCCAGGCGGC 102 0 

CAGGGCGCCG CACGGCTCTC ACCCGGCCGG TGTGTGTCCC CGCAGGAGAG TGTGCTGGGC 10 80 



10 



15 



20 



25 



AGACG ATG CTG GAC ACG ATG GAG GCG CCC GGC CAC TCC AGG GAG CTG 1127 
Met Leu Asp Thr Met Glu Ala Pro Gly His Ser Arg Gin Leu 
1 5 10 

CTG CTG GAG CTC AAC AAC CAG CGC ACC AAG GGC TTC TTG TGC GAC GTG 1175 
Leu Leu Gin Leu Asn Asn Gin Arg Thr Lys Gly Phe Leu Cys Asp Val 
i5 20 25 30 

ATC ATC GTG GTG CAG AAC GCC CTC TTC CGC GCG CAC AAG AAC GTG CTG 122 3 

He He Val Val Gin Asn Ala Leu Phe Arg Ala His Lys Asn Val Leu 
35 40 45 

GCG GCC AGC AGC GCC TAC CTC AAG TCC CTG GTG GTG CAT GAC AAC CTG 12 71 

Ala Ala Ser Ser Ala Tyr Leu Lys Ser Leu Val . Val His Asp Asn Leu 
50 55 60 

CTC AAC CTG GAC CAT GAC ATG GTG AGC CCG GCC GTG TTC CGC CTG GTG 1319 
Leu Asn Leu Asp His Asp Met Val Ser Pro Ala Val Phe Arg Leu Val 
65 70 75 
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CTG GAC TTC ATC TAC ACC GGC CGC CTG GCT GAC GGC GCA GAG GCG GCT 1367 
Leu Asp Phe He Tyr Thr Gly Arg Leu Ala Asp Gly Ala Glu Ala Ala 
80 85 90 

GCG GCC GCG GCC GTG GCC CCG GGG GCT GAG CCG AGC CTG GGC GCC GTG 1415 
5 Ala Ala Ala Ala Val Ala Pro Gly Ala Glu Pro Ser Leu Gly Ala Val 

95 100 105 110 

CTG GCC GCC GCC AGC TAC CTG CAG ATC CCC GAC CTC GTG GCG CTG TGC 14 6 3 

Leu Ala Ala Ala Ser Tyr Leu Gin He Pro Asp Leu Val Ala Leu Cys 
115 120 125 

10 AAG AAA CGC CTC AAG CGC CAC GGC AAG TAC TGC CAC CTG CGG GGC GGC 1511 

Lys Lys Arg Leu Lys Arg His Gly Lys Tyr Cys His Leu Arg Gly Gly 
130 135 140 

GGC GGC GGC GGC GGC GGC TAC GCG CCC TAT GCT ATG GCG ACG AGC TGG 155 9 

Gly Gly Gly Gly Gly Gly Tyr Ala Pro Tyr Ala Met Ala Thr Ser Trp 
15 145 150 155 

GCC GGG AGC GCG GCT CCC CCA GCG AGC GCT GCG AAG AGC GTG GTG GGG 16 0 7 

Ala Gly Ser Ala Ala Pro Pro Ala Ser Ala Ala Lys Ser Val Val Gly 
160 165 170 

ACG CGG CCG TCT CGC CCG GGG GGC CCC CGC TCG GCC TGG CGC CGC CGC 16 5 5 

20 Thr Arg Pro Ser Arg Pro Gly Gly Pro Arg Ser Ala Trp Arg Arg Arg 

175 180 185 190 

CGC GCT ACC CTG GCA GCC TGG ACG GGC CCG GCG CGG GCG GCG ACG GCG 17 03 

Arg Ala Thr Leu Ala Ala Trp Thr Gly Pro Ala Arg Ala Ala Thr Ala 
195 200 205 

25 ACG ACT ACA AGA GCA GCA GCG AGG AGA CCG GTA GCA GCG AGG ACC CCA 1751 

Thr Thr Thr Arg Ala Ala Ala Arg Arg Pro Val Ala Ala Arg Thr Pro 
210 215 220 

GCA CCG CCT GGC GGC CAC CTC GAG GGC TAC CCA TGC CCG CAC CTG GCC 17 99 

Ala Pro Pro Gly Gly His Leu Glu Gly Tyr Pro Cys Pro His Leu Ala 
30 225 230 235 

TAT GGC GAG CCC GAG AGC TTC GGT GAC AAC CTG TAC GTG TGC ATT CCG 184 7 

Tyr Gly Glu Pro Glu Ser Phe Gly Asp Asn Leu Tyr Val Cys He Pro 
240 245 250 

TGC GGC AAG GGC TTC CCC AGC TCT GAG CAG CTG AAC GCG CAC GTG GAG 18 95 

35 Cys Gly Lys Gly Phe Pro Ser Ser Glu Gin Leu Asn Ala His Val Glu 

255 260 265 270 
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GCT CAC GTG GAG GAG GAG GAA GCG CTG TAG GGC AGG GCC GAG GCG GCC 194 3 

Ala His Val Glu Glu Glu Glu Ala Leu Tyr Gly Arg Ala Glu Ala Ala 
275 280 285 

GAA GTG GCC GCT GGG GCC GCC GGC CTA GGG CCC CCT TTT GGA GGC GGC 1991 
Glu Val Ala Ala Gly Ala Ala Gly Leu Gly Pro Pro Phe Gly Gly Gly 
290 295 300 

GGG GAC AAG GTC GCC GGG GCT CCG GGT GGC CTG GGA GAG CTG CTG CGG 2 03 9 

Gly Asp Lys Val Ala Gly Ala Pro Gly Gly Leu Gly Glu Leu Leu Arg 
305 310 315 

CCC TAC CGC TGC GGC TCG TGC GAC AAG AGC TAC AAG GAC CCG GCC ACG 2087 
Pro Tyr Arg Cys Gly Ser Cys Asp Lys Ser Tyr Lys Asp Pro Ala Thr 
320 325 330 

CTG CGG CAG CAC GAG AAG ACG CAC TGG CTG ACC CGG CCC TAC CCA TGC 213 5 

Leu Arg Gin His Glu Lys Thr His Trp Leu Thr Arg Pro Tyr Pro Cys 
335 340 345 350 

ACC ATC TGC GGG AAG AAG TTC ACG CAG CGT GGG ACC ATG ACG CGC CAC 2183 
Thr He Cys Gly Lys Lys Phe Thr Gin Arg Gly Thr Met Thr Arg His 
355 360 365 

ATG CGC AGC CAC CTG GGC CTC AAG CCC TTC GCG TGC GAC GCG TGC GGC 22 31 

Met Arg Ser His Leu Gly Leu Lys Pro Phe Ala Cys Asp Ala Cys Gly 
370 375 380 

ATG CGG TTC ACG CGC CAG TAC CGC CTC ACC CGG ACG CAC ATG CGC ATC 22 79 

Met Arg Phe Thr Arg Gin Tyr Arg Leu Thr Arg Thr His Met Arg He 
385 390 395 

CAC CCT CGC GGC GAG AAG CCC TAC GAG TGC CAG GTG TGC GGC GGC AAG 23 27 

His Pro Arg Gly Glu Lys Pro Tyr Glu Cys Gin Val Cys Gly Gly Lys 
400 405 410 

TTC GCA CAG CAA CGC AAC CTC ATC AGC CAC ATG AAG ATG CAC GCC GTG 23 75 

Phe Ala Gin Gin Arg Asn Leu He Ser His Met Lys Met His Ala Val 
415 420 425 430 

GGG GGC GCG GCG GCG CGG CCG GGG CGC TGG CGG GCT TGG GGG GGC TCC 2423 
Gly Gly Ala Ala Ala Arg Pro Gly Arg Trp Arg Ala Trp Gly Gly Ser 
435 440 445 

CCG GCG TCC CCG GCC CCG ACG GCA AGG GCA AGC TCG ACT TCC CCG AGG 2471 
Pro Ala Ser Pro Ala Pro Thr Ala Arg Ala Ser Ser Thr Ser Pro Arg 
450 455 460 
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GCG TCT TTG CTG TGG CTC GCT CAC GGC CGA GCA GCT GAG CCT GAA GCA 2519 
Ala Ser Leu Leu Trp Leu Ala His Gly Arg Ala Ala Glu Pro Glu Ala 
465 470 475 

GCA GGA CAA GGC GGC CGC GAC CGA GCT GCT GGC GCA GAC CAC GCA CTT 
5 Ala Gly Gin Gly Gly Arg Asp Arg Ala Ala Gly Ala Asp His Ala Leu 

480 485 490 

CCT GCA CGA CCC CAA GGT GGC GCT GGA GAG CCT CTA CCC GCT GGC CAA 
Pro Ala Arg Pro Gin Gly Gly Ala Gly Glu Pro Leu Pro Ala Gly Gin 
495 500 505 510 

10 GTT CAC GGC CGA GCT GGG CCT CAG CCC CGA CAA GGC GGC CGA GGT GCT 

val His Gly Arg Ala Gly Pro Gin Pro Arg Gin Gly Gly Arg Gly Ala 
515 520 525 

GAG CCA GGG CGC TCA CCT GGC GGC CGG GCC CGA CGG CGG ACC ATC GAC 
Glu Pro Gly Arg Ser Pro Gly Gly Arg Ala Arg Arg Arg Thr He Asp 
15 530 535 540 

CGT TTC TCT CCC ACC TAGAGCGCCC CTCGCCAGCC CGCTCTGTCG CTGCTGCGCG 
Arg Phe Ser Pro Thr 
545 

GCCCTGGCCC GCACCCCAGG GAGCGGCGGG GGCGGCGCGC AGGGCCCACT GTGCCCGGGA 2826 

20 CAACCGCAGC GTCGCCACAG TGGCGGCTCC ACCTCTCGGC GGCCTCACCT GGCCTCACTG 2 8 86 

CTTCGTGCCT TAGCTCGGGG GTCGGGGGAG AACCCCGGGA CGGGGTGGGA TGGGGTAAGG 294 6 

GAAATTTATA TTTTTGATAT CAGCTTTGAC CAAAGGAGAC CCCAGGCCCC TCCCGCCTCT 3006 

TCCTGTGGTT CGTCGGCCCC CTCCCCCGGC TCCGCGCTGC TCTTAGAGGG GGAGGGGTGT 3 066 

CACTGTCGGG GCACTCCTAG CCCTACCTCC GGCCCTTGCG ACCACACCCA TTCTCACTGT 312 6 

25 GAATCTCCCC GCTGGGTCGG AGCGTCGGGC AGAGTTGGGG AGTGGGGAGG GGACTGAGCC 318 6 

GGCCGGAGGC CCCCGCACCC CCGCTCCCAC CCACCCCGGG ACTGATAATG TGAAGTTCCT 324 6 

CATTTTGCAC AAGTGGCACT AGCCCAGGGC CAACCCTTCC TTCCTCAGTC ACCAAGGGCG 3306 

GGGAGTTCTG GAGTCGGAAG GCGAAGAGCC TACCACCAGG TCTCCCACTC CCGCGGTGCC 3366 

CTCCCTTCCC TTCCCTGCGG CCCCGGACCA TATTTATTGC ATGCGCCCCG GCGGCCCCCC 3426 

30 ATCCCGAGCC CAGGCTGGGC TGGGCTGGAA CGCGGTCTCT TTAGCTCCCT CCTCTTCGTT 34 8 6 
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TGTATATTTC CTACCTTGTA CACAGCTCTT CCAGAGCCGC TTCCATTTTC TATACTCGAA 3 54 6 

CCAAACAGCA ATAAAGCAGT AACCAAGGAC CCCGACCCCG CTGCTCTCTT CTGCCCCTGC 3 606 

ACAAGGACCT GGATGCTGCG CCCGCTGGGT GGAGGAGCCA GAAAGGGCCA CCCTCACACA 3 66 6 

GGTGCAGAGG CTTGGACCTG CCTCCCTCCC CAGTCCCAGA AACAGATCAG CAAGAGGTCA 3 72 6 

GGTATGTTTC ATAACTAAAA ATTTATTAAG GAAACAAAAC CAGTGCTGCA AACGGGACAG 3 786 

AAAGGAGAGC TGGGTCTCCC TCCCGACCAC CCAGTCATCG GCCTTCCAGC TGGGGAGAGA 3 84 6 

ATCTTAAAGG AGAGGCCGGG GACCCTGTAC TCCAAAGAGC CCAGTCTTCT GAGACTCTAG 3 90 6 

GGGACTCCTA CCCCCAAACT ACTGGCCTTG GCTCCCCTAC ACGGTACCCC ATCGCTTCTG 3 966 

GCATAGTCCT GGGCCTCAGG GAGGGCAGAG CTGCGCACCC ATCCTCCAGG CAGGCTGTGC 4 026 

AGTCAGGCCA TGGGCTCTGG GGTATCCCCC ACTGGTCCCA TTAAGATTTG CCCCTGGCTC 4086 

CACCGAAAAC CCCGTCTTCC CCTAAG ^ , , 

4112 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Leu Asp Thr Met Glu Ala Pro Gly His Ser Arg Gin Leu Leu Leu 

15 10 15 

Gin Leu Asn Asn Gin Arg Thr Lys Gly Phe Leu Cys Asp Val He He 
20 25 30 

Val Val Gin Asn Ala Leu Phe Arg Ala His Lys Asn Val Leu Ala Ala 
35 40 45 

Ser Ser Ala Tyr Leu Lys Ser Leu Val Val His Asp Asn Leu Leu Asn 
50 55 60 

Leu Asp His Asp Met Val Ser Pro Ala Val Phe Arg Leu Val Leu Asp 
65 70 75 80 
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Phe lie Tyr Thr Gly Arg Leu Ala Asp Gly Ala Glu Ala Ala Ala Ala 
85 90 95 

Ala Ala Val Ala Pro Gly Ala Glu Pro Set Leu Gly Ala Val Leu Ala 
100 105 110 

5 Ala Ala Ser Tyr Leu Gin lie Pro Asp Leu Val Ala Leu Cys Lys Lys 

115 120 125 

Arg Leu Lys Arg His Gly Lys Tyr Cys His Leu Arg Gly Gly Gly Gly 
130 135 140 

Gly Gly Gly Gly Tyr Ala Pro Tyr Ala Met Ala Thr Ser Trp Ala Gly 
10 145 150 155 160 

Ser Ala Ala Pro Pro Ala Ser Ala Ala Lys Ser Val Val Gly Thr Arg 
165 170 175 

Pro Ser Arg Pro Gly Gly Pro Arg Ser Ala Trp Arg Arg Arg Arg Ala 
180 185 190 

15 Thr Leu Ala Ala Trp Thr Gly Pro Ala Arg Ala Ala Thr Ala Thr Thr 

195 2O0 205 

Thr Arg Ala Ala Ala Arg Arg Pro Val Ala Ala Arg Thr Pro Ala Pro 
210 215 220 

Pro Gly Gly His Leu Glu Gly Tyr Pro Cys Pro His Leu Ala Tyr Gly 
20 225 230 235 240 

Glu Pro Glu Ser Phe Gly Asp Asn Leu Tyr Val Cys He Pro Cys Gly 
245 250 255 

Lys Gly Phe Pro Ser Ser Glu Gin Leu Asn Ala His Val Glu Ala His 
260 265 270 

25 Val Glu Glu Glu Glu Ala Leu Tyr Gly Arg Ala Glu Ala Ala Glu Val 

275 280 285 

Ala Ala Gly Ala Ala Gly Leu Gly Pro Pro Phe Gly Gly Gly Gly Asp 
290 295 300 

Lys Val Ala Gly Ala Pro Gly Gly Leu Gly Glu Leu Leu Arg Pro Tyr 
30 305 310 315 320 

Arg Cys Gly Ser Cys Asp Lys Ser Tyr Lys Asp Pro Ala Thr Leu Arg 
325 330 335 
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Gin His Glu Lys 
340 

Cys Gly Lys Lys 
355 

5 Ser His Leu Gly 

370 

Phe Thr Arg Gin 
385 

Arg Gly Glu Lys 

10 

Gin Gin Arg Asn 
420 

Ala Ala Ala Arg 
435 

15 Ser Pro Ala Pro 

450 



Thr His Trp Leu Thr Arg 
345 

Phe Thr Gin Arg Gly Thr 
360 

Leu Lys Pro Phe Ala Cys 
375 

Tyr Arg Leu Thr Arg Thr 
390 

Pro Tyr Glu Cys Gin Val 
405 410 

Leu lie Ser His Met Lys 
425 

Pro Gly Arg Trp Arg Ala 
440 

Thr Ala Arg Ala Ser Ser 
455 



Pro Tyr Pro Cys Thr lie 
350 

Met Thr Arg His Met Arg 
365 

Asp Ala Cys Gly Met Arg 
380 

His Met Arg lie His Pro 
395 400 

Cys Gly Gly Lys Phe Ala 
415 

Met His Ala Val Gly Gly 
430 

Trp Gly Gly Ser Pro Ala 
445 

Thr Ser Pro Arg Ala Ser 
460 



Leu Leu Trp Leu 
465 

Gin Gly Gly Arg 

20 

Arg Pro Gin Gly 
500 

Gly Arg Ala Gly 
515 

25 Gly Arg Ser Pro 

530 



Ala His Gly Arg 
470 

Asp Arg Ala Ala 
485 

Gly Ala Gly Glu 



Pro Gin Pro Arg 
520 

Gly Gly Arg Ala 
535 



Ala Ala Glu Pro 
475 

Gly Ala Asp His 
490 

Pro Leu Pro Ala 
505 

Gin Gly Gly Arg 



Arg Arg Arg Thr 
540 



Glu Ala Ala Gly 
480 

Ala Leu Pro Ala 
495 

Gly Gin Val His 
510 

Gly Ala Glu Pro 
525 

lie Asp Arg Phe 



Ser Pro Thr 
545 
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CLAIMS 

1. A substantially pure HIC-1 (hypermethylated in cancer) polypeptide 
consisting essentially of the amino acid sequence of SEQ ID NO:3. 

2 An isolated polynucleotide sequence consisting essentially of a polynucleotide 
sequence encoding a polypeptide having an amino acid sequence of SEQ ID 
NO 3. 

3 The isolated polynucleotide sequence of claim 2, consisting essentially of a 
polynucleotide sequence encoding a polypeptide having an amino acid 
sequence of SEQ ID NO 3 and having at least one epitope for an antibody 
immunoreactive with HIC- 1 polypeptide. 

4 The polynucleotide of claim 2, wherein the nucleotide sequence is selected 
from the group consisting of 

a) SEQ ID NO 1 , wherein T can also be U, 

b) nucleic acid sequences complementary to a), 

c) fragments of a) or b) that are at least 1 5 bases in length and 
which will selectively hybridize to genomic DNA which 
encodes HIC-1. 

5 A recombinant expression vector which contains the polynucleotide of claim 

2. 

6 A host cell which contains the expression vector of claim 5 

7. An antibody which binds to the polypeptide of SEQ ID N0.3 and which binds 
with immunoreactive fragments of SEQ ID NO 3 



8 



The antibody of claim 7, wherein the antibody is polyclonal. 
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9. The antibody of claim 7. wherein the antibody is monoclonal. 

10 A method for detecting a cell proliferative disorder associated with HIC-1 in 
a subject, comprising contacting a target cellular component containing HIC-1 
with a reagent which reacts with HIC-1 and detecting HIC-1. 

11. The method of claim 10, wherein the target cellular component is nucleic acid. 

12 The method of claim 1 1, wherein the nucleic acid is DNA. 

13 The method of claim 1 1, wherein the nucleic acid is RNA. 

14 The method of claim 1 1, wherein the nucleic acid is hypermethylated. 

15 The method of claim 1 0, wherein the target cellular component is protein 

16 The method of claim 10, wherein the reagent is a probe 

1 7 The method of claim 16, wherein the probe is nucleic acid 

18 The method of claim 16, wherein the probe is an antibody 

19 The method of claim 18, wherein the antibody is polyclonal 

20 The method of claim 18, wherein the antibody is monoclonal. 

21 . The method of claim 16, wherein the probe is detectably labeled. 

22 The method of claim 21, wherein the label is selected from the group 
consisting of a radioisotope, a bioluminescent compound, a chemiluminescent 
compound, a fluorescent compound, a metal chelate, or an enzyme. 
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23. The method of claim 10, wherein the reagent is a restriction endonuclease. 

24. The method of claim 23, wherein the restriction endonuclease is methylation 
sensitive. 

25. The method of claim 24, wherein the restriction endonuclease is selected from 
the group consisting of Mspl, Hpall, BssHll and Notl. 

26. The method of claim 10, wherein the cell proliferative disorder is associated 
with a tissue selected from the group consisting of brain, colon, urogenital, 
lung, renal, hematopoietic, breast, thymus, testis, ovarian, and uterine 

27 The method of claim 26, wherein the disorder is selected from the group 
consisting of low grade astrocytoma, anaplastic astrocytoma, glioblastoma, 
meduUoblastoma, colon cancer, lung cancer, renal cancer, leukemia, breast 
cancer, prostate cancer, endometrial cancer and neuroblastoma. 

28 A method of treating a cell proliferative disorder associated with HIC-1, 
comprising administering to a subject with the disorder, a therapeutically 
effective amount of reagent which modulates HIC-1 expression 

29. The method of claim 28, wherein the reagent is a polynucleotide sequence 
comprising a HIC-1 sense polynucleotide sequence. 

30. The method of claim 29, wherein the reagent further includes is a 
polynucleotide sequence which encodes a promoter in operable linkage to the 
HIC- 1 polynucleotide sequence. 



31 



The method of claim 29, wherein the polynucleotide sequence is in an 
expression vector. 
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32 The method of claim 28, wherein the disorder is associated with a tissue 
selected from the group consisting of brain, urogenital, lung, colon, renal, 
hematopoietic, breast, thymus, testis, ovarian, and uterine. 

33 The method of claim 32, wherein the disorder is selected from the group 
consisting of low grade astrocytoma, anaplastic astrocytoma, glioblastoma, 
meduUoblastoma, colon cancer, lung cancer, renal cancer, leukemia, breast 
cancer, prostate cancer, endometrial cancer and neuroblastoma. 

34 The method of claim 28, wherein the HIC-1 associated cellular proliferative 
disorder is associated with hypermethylation of HIC-1 nucleotide sequence 

35 A method of gene therapy comprising introducing into cells of a host subject, 
an expression vector comprising a nucleotide sequence encoding HIC-1, in 
operable linkage with a promoter. 

36 The method of claim 35, wherein the expression vector is introduced into the 
subject's cells ex vivo and the cells are then reintroduced into the subject 

37 The method of claim 35, wherein the expression vector is an RNA virus 

38 The method of claim 37, wherein the RNA virus is a retrovirus. 

39. The method of claim 35, wherein the subject is a human. 

40. The method of claim 35, wherein the disorder is associated with hyper- 
methylation of HIC-1 polynucleotide. 
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41. A diagnostic kit useful for the detection of a target cellular component 
indicative of a cell proliferative disorder associated with methylation of HIC-1 
nucleic acid comprising carrier means being compartmentalized to receive in 
close confinement therein one or more containers comprising a first container 

5 containing a probe for detection of methylated HIC-1 nucleic acid, 

42. The kit of claim 41, wherein the target cellular component is a HIC-1 
polypeptide. 

43. The kit of claim 42, wherein the probe is an antibody. 

44 The kit of claim 41, wherein the target cellular component is a nucleic acid 
sequence. 

45 . The kit of claim 44, wherein the probe is a polynucleotide hybridization probe 

46 A method for identifying a tumor suppressor gene comprising detecting 
abnormal nucleic acid methylation in a nucleic acid sample and identifying the 
gene. 

47 The method of claim 46, wherein the nucleic acid comprises at least one CpG 
island nucleotide sequence 



48 



The method of claim 47, wherein the CpG nucleotide sequence is hyper 
methylated. 
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CCC GOC ceo CCa OOA CCa CAO OTA ACQ CXIC COC GGO OCC CCa COO CICC AGO AGO 
OOA AGO GOO TCa OOC OOQ CQA GCA oca OOC AGO OOA OCT CAO OOC TCa OCT ceo 
GGC TCT GCC GCC GGA TIT GOG GGC CGC GAG GAA GAG CrO CGA GCX: GAG GGC era 
GGG ceo OCG CAC TCC TCC COC CCT GTC TGC AOTTGG AAA ACTTIT CCC CAA OTT 
TOG GGC GGC GGA GIT CCG GOG GAG AAG GGG CCG GGG GAG CCG CGG AGG GAG GCG 
CCG GGC CCG CGC GTG TAG GGC CCA GGC CGA GGC CGG GAG GCG GOT GGG GCG CAG 
GCC CGG GTC AGO GCC OCA GCC GGC TOT GCG CCG TOC CCa CCC GGG GCG era CCC 
CCT CCC TCC CCT GGG AGC TGC Crro OCT CCC CCC TCC CCC CCA CCr GCT TCC TGC 
CTC AGC CrC era CCC CGA tat AAC GCC CrC CCC GCG CCG OGC CCG GCC TTC GCG 
CTC TOC CCG CCA COG CAO CCG era CCT CCG CrC CCC GCG COG CCG CCG CCC GOO 
EXON X-S*UTR 

CCC CGA CCG AGG GIT OAC AGC CCC CGG CCA GGG CCG CGC CAG GGC GGG CAC CGC 
GCT CCC CTC CTC COT ATC ACT TCC CCC AAC TOO GGC AAC TTC TCC CGA GGC GGG 
AGG CGC TOG TTC CTC OGC TCC CTT TCT CCC TAC TTO GCT AAA GTT CTC CGC CCT 
GAA TGA CTT TTC CTO AAG CGG ACA TIT TAC TTA AAT COG CTA ACT OTC TCC AAA 
AGG GTC ACT GCG CCT OAA CAG TIT TCT TCT CGG AAG CCC CAG CAC CCA GCC AGG 
TOC CCT GGG OCG TOC AGO CCG CCC TOG CCT CCC CTC CAC CGG CGG CCG CTC ACC 

DmtON 

TCC TOC TCC TTC TCC TOO TCC GGG COG GCC GGC CrO GGC TCC CAC TCC AGA GGG 
CAG CTO GTC CTT CGC COG TOC CCA GGC CGC AGO GCT GAT GCC CCC OCT CAG CTO 
AGG GAA GGG GAA GIG GAG GGG AGA AOT GCC GGG CTO GOG CCA GGC GGC CAG GGC 
GCC GCA COG CTC TCA CCC OOC COG TOT OTO TCC CCG CAO GAG AOT Oro CTO OOC 
AGA CGA TGC TOO ACA CGA TOG AGG CGC CCG GCC ACT CCA GGC AGC TOC TOC TOC 



EXONS MalLMAjp1trMatGtaAlaProai|rHis8«r ArtOlnLMiLtBLMafa 

AGC TCA ACA ACC AGC GCA CCA AGG GCT TCT TOT GCG ACO TGA TCA TCO TOG TOC 



LtoABAM(9aAft'nrL.y*GI]rPbBUoCjrs A^i Vain* D* Vd VilOb 

AGA ACO CCC TCT TCC oca CGC ACA AGA ACO TOC TOG COO CCA OCA OCG CCT ACC 



Am Ala Lm Pha Ars Al» Hi* LyB Ajb Val Lm Ala Ala S«r 8«r Ala IVr Lm 

TCA AOT CCC TOG TOG TGC ATO ACA ACC TOC TCA ACC TOO ACC ATO ACA TOG TGA 



Ly 8«r Lm Val Val Hu Asp An Lra Lmi Am Lra Aap His Asp Met Val 8«r 

GCC COG CCG TOT TCC GCC TOO TOC TOO ACT TCA TCT ACA CCO GCC GCC TOG CTO 



Pre Ala Vil Phs ArB Uu Val L«a Aap Phs nt lyr Tfar Gly Arg Ua Ala Asp 

ACO GCG CAG AGO CGG era COG CCG COG CCG TOG CCC COG GGG CTO AGC CGA GCC 



Ala Gla Ala Ala Ala Ala Ala Ala Val Ala Pra Gljr Ala Ghi Pro Sv Ua 

SUBSTITUTE SHEET (RULE 26) 



FIG. 1C-1 
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TCW GCO ceo TOC TOO ceo CCG CCA OCT ACC TGC AGA TCC CCQ 



Gly AJft Vd Lra Ala Ala Ala Scr ly Lea CSa n« Pro Aip Uy VaJ Ala Utt 

TOT OCA AOA AAC OCC TCA AOC OCC ACQ OCA ACn^ ACT GCC ACC TOC OOQ OCO GCO 
Cyt Lyi Ly* Ari Lm Lyi Art HiJ Cajr Lyt Cyt Hit Ari Giy Giy 



GCG oca GCO GCG GCO OCT ACQ CGC CCT ATO GTC GGC COG OCC GOG GCC TGC GOO 
Oy Gty Oly Gly Cfly Tyr Ala Pro T>r 



COG CCA CGC COT CAT CCA GGC CIG CTA CCC GTC CCC AOr COG GCC TCC GCC OCC 
OCCTOC COC OGA GCC GCC Crc GGG CCC AGA GGC CGC OCTCAA CAC OCA CIG CGC 
COA CCT OTA CGC ore GOG ACC COG CCC GGC CGC CGC ACT era TOC Crc OGA GCG 

CCG cro ere ecc TCTTiG TOG CCT OGA CCT crc CAA GAA GAG ccc GCC GOG ere 

CGC GGC GCC AGA GCG GCC OCT GGC TQA GCO COA OCT GCC CCC GCG CCC OGA CAO 
Orc TCC CAO CGC COG CCC CGC CGC CTA CAA Ga\ GCC GCC TCT CGC CCT GCC 
GCT OCC OCC OCT OCC err CCA GAA OCT OGA OGA GGC CGC ACC GCC TTC COA CCC 
ATT TCO COG COG CAO COG CAO CCC GOG ACC COA GCC CCC COG CCG CCC CAA COG 
OCC TAG TCT CCT CTATCG era GAT GAA OCA COA GCC GOG CCT GOO TAG CTA TOG 



XXONS AlaMitAla 

COA COA OCT GOG CCG OGA GCG COO CTC CCC CAO COA GCO CTO COA AGA GCG TGO 



Hr 8cr Trp Ala cay Scr Ala Ata Pro Pro Ala Scr Ala Ala Lya 8«r Val Val 



TOG OGA CGC GGC COT CTC GCC COG GOG GCC CCC GCT COG CCT GGC OCC OCC OCC 



Gly Tfer Art Pre 8«r Arg Pro Gly Giy Pro Ari Scr Ala Trp Atb Atb Ars Arg 



GOO CTA CCC TOG CAO CCT OGA COG GCC COG CGC GOG COG COA COG COA COA CTA 



Ala Tir Lnt Ala Ala Trp Tir Oly Pro Ala Arg Ala Ala Tlr Ala Ilr Ilr IV 



CAA GAG CAO CAO COA OGA GAC COG TAG CAO CGA OGA CCC CAG CAC CGC CTO GCO 



AffgAlaAlaAlaArgArgPro Vol AlaAlaArgTlrPro AlaProProGlyGly 



OCC ACC TCO AGO GCT ACC CAT GCC CGC ACC TGO CCT ATO GCG AGC CCG AGA GCT 



Hit Lttt Glu Gly IVrPra Cyt Pre Hit Leu AlaTyr Oly Glu Pre Gla Scr Pht 



TOO CTO ACA ACC TCT ACQ TCT GCA TTC CCT GCG OCA AGG GCT TCC CCA GCT CTC 



FIG. 1C-2 
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aif Aip Am Leo TVr Vtl Cyi lit Pre Cjn Gly Lfi Gly ^ Scr 8cr Oht 

AOC AGC TGA ACQ CGC ACQ TOO AGO CTC ACXJ TOO AGO AGO AGG AAG CGC TOT ACQ 
GlDUuAmAlaHii V«l Gla Ala Hb Vil da Oa Ghi Glu AlaLcuTVrGly 

GCA GGG CCG AGG CGG CCG AAG TGG CCC CTG GGG CCG CCO GCC TAG GGC CCC CTT 
Arg Ala Glu Ala Ala Glu Val Ala Ala Gly Ala AU Gly Lm Gly Pre Pre Vh$ 

TTO GAG GCO GCG GGG ACA AGGTCG CCG GGG CTC CGG GTG OCC TGG GAG AGC TOC 
Gly Gly Gly Gly Asp Lyi Val Ala Gly Ala Pre Oy Gly Lra Gly Glu Lau Leu 

TOC GGC CCT ACC GCTGCG GCrCCr GCO ACA AQA GCT ACA AGG ACC CGG CCA CGC 
Arg Pro Tyr Arg Cyi Gly Set Cyt A«p Lyi S^r Tyr Lyt Aip Pre AlaTfar Lra 

TOC GGC AGC ACG AOA AGA CGC ACT GGC TGA CCC GGC CCT ACC CAT GCA CCATCT 
Ari Ola Hit Glo Ly« Tht Hit Tip Lra Tfar Arg Pre Tyr Pre Cys Hr Da Cyt 

GCG GOA AGA ACT TCA CGC AGC OTG GOA CCA TGA CGC GCC ACA TOC GCA GCC ACC 
Gly Lyt Lyt Piw IV Gla Arg Gly Ite Mtt Itr Arg Hit Mat Ari Scr Hit Lra 

TOG GCC TCA AGC CCT TCG CCT GCG ACG CCT GCG GCA TGC GOT TCA CGC GCC AOT 
Gly Lra Lyt Pre Pbt Ala Cyt Atp Ala Cyt <ay Mat Arg Pha Tlr Arg Glfl Tyr 

ACC GCC TCA CCC GGA CGC ACA TGC GCA TCC ACC CTC GCG GCG AGA AGC CCT ACG 
Arg Lra Tfar Arg Hr Hit Met Arg lit Hit Pre Arg Gly Glu Lyt Pro T)t Glu 

AOT GCC AGG TOT GCG GCG GCA AOT TCG CAC AGC AAC GCA ACC TCA TCA GCC ACA 
Cyt Gfai Val Cyt Gly Gly Lyt Pht Ala Gla Ola Arg Aaa Lra lit Scr Hit Met 

TOA AGA TGC ACG CCG TGG GOG GCO COG COG CGC GGC COG GGC GCT GGC GOG CFT 
LytMctHit AlaValGlyOlyAlaAlaAlaArgPreGlyArgTrpArgAlaTrp FIG. 1 C-3 
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OOO CKW OCT CCC COO cor CCC CXW CCC COA COO CAA CKW CAA 
Gly cay Pm AU Pro AU Fro Ttr AI a Ari AU S«r Tlr 8«r Pro 

COA OOO CGT CIT TGC TOT GOC TCO Crc ACQ OCC OAO CAO era AGC era AAG GAG 
Afg AU 8«r Utt Lra Trp Lm Alo Hio Oy Ari AU Alft Oo Pro Glu Aim Ala 

CAOGAC AAO GCG OCC OCO ACC OAO CTQ CTO OCO GAG ACC ACO CAC TTC CTG CAC 
Gly Ola Gly Gly Arg Ajp Arg Al» Ala Oly Ala Aip His Ala Ltu Pro Ala Arg 

OAC CCC AAG CTG GCG CTO GAG AOC CTCTAC CCO CTO OCC AAG TTC ACO OCC OAO 
Pro Gin Gly Gly Ala Gly Gltt Pro Uo Pro Ala cay Gte Vol Hu Gly Ars Ala 

CTO GOC crc AGC CCC OAC AAO OCO OCC GAO OTO CTO AOC CAG GOC OCT CAC CTO 
Gly Pro Ola Pro Ar» Gta Oly Oly Afi CHy Ala Ghi Pro Gly Arg Str Pro Gly 

OCO OCC GOG CCC OAC GOC GOA CCA TCO ACC OFT TCT CTC CCA CCT AOA GCG CCC 
Gly Ar» Ala Arg Arg Arg Tlr Da A«p Arg Pba 8«r Pro -ftr 

crc OCC AGC CCG crc TOT COC TGC TOC OCO OCC CTO OCC COC ACC CCA GOG AOC 
GOCGGOGGCOGCGCGCAOGOCCCACTaTOCCCOOaACAACCGCAGCCTCOC CAC 
AOT OGC GOC TCC ACC TCT COG COO CCT CAC Cro OCC TCA CTG err CCT OCC Tr A 
OCT COO GOO TCO GOO OAO AAC CCC OOO ACO OOO TOG OAT OOO OTA AGO GAA ATr 
TAT ATr TTT GAT ATC AOC Trr OAC CAA AGO AGA CCC CAO GCC CCT CC C OCC TCT 
TCC TCT GOT TCO TCO OCC CCC TCC CCC GOC TCC GCG era erc TTA GAO GOG GAG 
GOO TCT CAC TOT COO GOC ACT CCT AOC CCT ACC TCC OGC CCT TOC OAC CAC ACC 
CAT TCT CAC TOT GAA TCT CCC COC TOO crc OGA GCG TCO GOC AGA OTT GOG GAO 
TtX3 OGA GOO OAC TGA GCC GOC COO AGO CCC CCG CAC CCC COC TCC CAC CCA CCC 
COO GAC TOA TAA TOT GAA OTT CCT CAT TTT OCA CAA OTO OCA CTA OCC CAO GOC 
CAA CCC TTC err CCT CAO TCA CCA AGO OCO GOG ACTTCT GOA GTC OGA AGO COA 
AGA GCC TAC CAC CAO crc TCC CAC TCC COC GOT GCC erc CCT TCC err CCC TOC 

OGC CCC GOA CCA TAT TTA TTO CAT GCG CCC COG COO CCC CCC ATC CCG AOC CCA 
OGC TOO OCT GGG CTG GAA COC GOT erc TTT AGC TCC erc erc Trc err TOT ATA 

Trr CCT ACC TTO TAC ACA OCT err CCA GAO CCO err CCA Trr TCT ATA erc GAA 

CCA AAC AGC AAT AAA OCA OTA ACC AAG OAC CCC OAC CCC OCT OCT CTC TTC TGC 
CCC TOC ACA AGO ACC TGO ATC CTO COC CCO ero GOT GOA GOA GCC AGA AAG GOC 
CAC CCT CAC ACA GOT OCA GAG OCT TOO ACC TGC CTC CCT CCC CAG TCC CAO AAA 
CAO ATC AGC AAG AGO TCA GOT ATC Trr CAT AAC TAA AAA Trr ATT AAG GAA ACA 
AAA CCA GTO CTG CAA ACO OGA CAO AAA OGA GAO CTO GOT erc CCT CCC OAC CAC 
CCA crc ATC OGC err CCA OCT OOO OAO AOA ATCTTA AAO OAO AGO CCG GOG ACC 
ero TAC TCC AAA GAO CCC AOT err CFG AGA erc TAG OOO ACT CCT ACC CCC AAA 
CTA CTC GCC TTO OCT CCC CTA CAC GOT ACC CCA TCO err ero GCA TAG TCC TGO 
OCC TCA GOO AGO OCA OAO ero COC ACC CAT CCT CCA GOC AGO era TOC AOT CAO 
GCC ATO OGC TCT GOO CTA TCC CCC ACT OCT CCC ATT AAG ATT TGC C C C TGO erc 
CACCGAAAACCCCCTCrrCCCCTAAOr FIG 
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